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(54) NOVEL Beta 1-4 N-ACETYLGLUCOSAMINYLTRANSFERASE AND GENE ENCODING THE 
SAME 

(57) The present invention relates to a novel 
enzyme having pi — > 4 N-acetylglucosaminy (trans- 
ferase (GnT-IV) activity; a gene encoding the enzyme; a 
recombinant DNA comprising the gene; a host cell com- 
prising the recombinant DNA; a method for producing 
an enzyme protein having GnT-IV activity comprising 
culturing the host cell in a medium; and a saccharide in 
which the sugar chain is modified using a GnT-IV. 

According to the present invention, a novel GnT-IV. 
a method for producing the enzyme and a gene coding 
for the enzyme are provided. With the GnT-IV of the 
present invention, it has become possible to produce a 
saccharide having a branching structure which could 
not be formed with conventional glycosyttransferases. 
Thus, the GnT-IV of the invention is useful not only for 
producing or improving glycoconjugate type pharma- 
ceuticals, reagents and foods, but also for modifying the 
sugar chain structure of any biopolymer. 
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Description 

Technical Field 

5 [0001] The present invention relates to a novel N-acetylglucosaminyltransferase (GlcNAc transferase) which recog- 
nizes a specific sugar chain structure in a saccharide and introduces thereinto a GlcNAc pi -» 4 branching structure. 

Background Art 
io 1 . Glycoproteins 

[0002] Most of proteins occurring in nature are not simple proteins composed of amino acids alone, but "mature" pro- 
teins having sugar chains and other substances such as phosphates and lipids attaching thereto. Therefore, the devel- 
opment of simple protein-type products produced by Escherichia coii as a host has involved various problems because 
is such products lack the maturing process of proteins. Since all of secretion-type physiologically active proteins (e.g. 
cytokines) are glycoproteins with a few exceptions, the function and the role of sugar chains have attracted attention as 
the most important point in the development of biological pharmaceuticals. 

[0003] Sugar chains in glycoproteins are classified roughly into Asn-linked type, mucin-type, O-linked GlcNAc type, 
GPI anchor type and proteoglycan type [Makoto Takeuchi, "Glycobiology Series 5: Glycotechnology", Kihata, Hakomori 

20 and Nagai (eds.), Kodansha Scientific Co.. (1994), 191-208]. Each of these types of sugar chains has its own biosyn- 
thesis pathway and a discrete physiological function. Asn-linked sugar chains are distributed widely in molds, yeasts, 
insects, plants and animals. The basic biosynthesis pathway for Asn-linked sugar chains is conserved beyond species 
(Fig. 1). A sugar chain(s) characteristic of a specific species is(are) formed on the outer side (called the "non-reducing 
terminal side") of the core sugar chain moiety which is common in the biosynthesis of Asn-linked sugar chains. A man- 

25 nan-type sugar chain in which a1.3- and a1 ,2-branching mannose residues attach to a main chain extending via a1 ,6 
linkages is a sugar chain structure characteristic of fungi such as yeasts (see Panel a in Fig. 2) [Hiroshi Nakajima. 
Sugar Chain Technology, Industry Survey Association (1992), 384-397]. On the other hand, in insects, plants and ani- 
mals, extension of mannose residues is not observed; instead, a high mannose type sugar chain is formed which is a 
sugar chain transferred from a dolichol intermediate and only trimmed (see Panel c in Fig. 2). A unique structure having 

30 characteristic xylose or the like (see Panel b in Fig. 2) is also observed in insects, plants and mollusks. In animals, char- 
acteristic sugar chain structures such as complex type sugar chain (Panel e in Fig. 2) and hybrid type sugar chain 
(Panel d in Fig. 2) are observed; in the former, GlcNAc branching structures are formed in a once trimmed sugar chain, 
and addition of other kinds of monosaccharides such as galactose and sialic acid forms complicated structures; in the 
latter, both a complex type sugar chain and a high mannose type sugar chain are present [Kiyoshi Furukawa, Sugar 

35 Chain Technology. Industry Survey Association (1992), 64-75]. 

[0004] Such sugar chains as described above are conferred on most of cell surface proteins and secretion proteins, 
and are thought to play important roles which determine the natures and properties of cells and proteins. Among all, 
the portion of a sugar chain structure which forms a branch elongating like antennas from the common core sugar chain 
is called a sugar chain branching structure. This structure is believed to have a function to give an organism recognition 

40 ligand (i.e., the end portion of the sugar chain) a high degree of freedom to thereby provide chances for multipoint rec- 
ognition and another function to maximize the protection ability for the protein moiety by greatly increasing the space- 
occupying volume (Takeuchi et aL, supra). Therefore, by controlling the branching structure of sugar chains, it is possi- 
ble to modify the physiological functions, sush as in vivo stability, in vivo kinetics and organ-targeting properties of glyc- 
oproteins in various ways. In view of this, technology to control branching structures of sugar chains is expected as 

45 biotechnology of the next generation for the development of glycoprotein-type pharmaceuticals which are "tender to 
humans". 

2. Physiological Significance of Glycoprotein Sugar Chains 

so [0005] Sugar chains of secretion type glycoproteins exhibit excellent functions in biosynthesis, intracellular sorting, 
masking of antigenicity, in vivo stability and organ-targeting properties of glycoproteins. Sugar chains of cell surface 
proteins are known to change in response to changes in cells (such as differentiation, change to a morbid state, can- 
ceration). In particular, it has been reported that there is a close relation between the metastasis of cancer and the 
branching structure of sugar chains. 

55 

(1) Masking of Antigenicity 

[0006] It is considered that sugar chains have a high degree of freedom in terms of steric structure and thus are mov- 
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ing freely like propellers. Therefore, protein molecules such as proteases and antibodies against proteins not having 
affinity to sugar chains are shook off by the sugar chains and thus cannot gain access to the protein moiety. As a result, 
even if there is antigenicity in the peptide moiety near the sugar chain binding site, antibody molecules cannot have 
access to the peptide moiety. Thus, an antigen-antibody reaction is extremely difficult to occur. Further, when a glyco- 
protein has been captured by a macrophage and the degradation products are presented as antigen, receptors are dif- 
ficult of access to the peptides around the sugar chain binding site. Thus, antigenic stimulation is difficult to occur. 
Actually, it is reported that when sugar chains have been introduced into the central portion of the antigenic peptide of 
ovalbumin lysozyme, the binding of MHC class II molecules to the antigen is remarkably inhibited [Mouritsen, S., 
Meldal. M., Christiansen-Brams, I., Eisner, H. and Werdelin, O., Eur. J. Immunol., (1994), 24, 1066-1072]. The effect of 
such masking of antigenicity becomes greater as the volume occupied by sugar chains is greater. Thus, it is considered 
that the development of a branching structure contributes to the effect of such masking greatly. 



(2) in Vivo Stability 

75 [0007] With respect to erythropoietin which is the first glycoprotein-type pharmaceutical ever produced from a trans- 
genic animal cell as a host, the functions of sugar chains thereof have been studied thoroughly. As a result, it has been 
shown that the sugar chains of erythropoietin work inhibitorily against the binding of erythropoietin with its receptor but 
make a decisive contribution to the retaining of the active structure and the improvement of in vivo kinetics; as a whole, 
the sugar chains have been shown to be essential for expression of the pharmacological activity of erythropoietin 

20 (Takeuchi, M. and Kobata, A., Giycobioiogy (1991), 1, 337-346]. In particular, a strong correlation between the number 
of antennae in sugar chains and the pharmacological effect of erythropoietin has been found, and thus the importance 
of its branching structure (a branching structure formed by GlcNAc residues attaching to the core sugar chain) which 
never attracted attention has been made clear for the first time [Takeuchi, M., Inoue, N., Strickland, T. W., Kobata, M., 
Wada, M., Shimizu, R., Hoshi. S., Kozutsumi, H.. Takasaki, S. and Kobata, A., Proc. Natl. Acad. Sci. USA, (1989), 86, 

25 781 9-22]. The major cause of the above phenomenon is explained as follows: erythropoietin without developed branch- 
ing structure is cleared rather rapidly in kidney and. as a result, the in vivo residence time of such erythropoietin 
becomes shorter [Misaizu, T, Matsuki. S., Strickland. T. W., Takeuchi. M., Kobata, A. and Takasaki, S., Blood, (1995). 
86,4097-4104]. 



30 (3) Organ Targeting Property 

[0008] Most of biological tissues have lectin-like receptors and use then in cell-cell interactions or to uptake glycopro- 
teins from blood. The asialoprotein-binding lectin in liver is a representative example of a clearance system for aged 
glycoproteins [Toshihiro Kawasaki. Sugar Chain Technology, Industry Survey Association (1992), 125-136]. In addition, 

35 selectin contained in vascular endothelial cells, platelets and leucocytes (Kawasaki, supra) and the lectin receptor 
present on the surface of macrophages and NK cells (Kawasaki, supra) are well known. Furthermore, not only glyco- 
proteins but also celts are known to gather in a specific tissue using sugar chains as ligands. Cases of the homing of 
bone marrow cells [Tatsuo Irimura, "Giycobioiogy Series 3: Giycobioiogy in Cell Society", Katsutaka Nagai, Senichiro 
Hakomori and Akira Kobata (eds.), Kodansha Scientific Co., (1993), 127-175] and the recruiting of neutrophiles to 

AO inflammatory sites (Irimura, supra) are examined in detail. Putting all these things together, it can be well assumed that 
glycoproteins and cells have, via their sugar chain structures, a targeting property toward specific organs or tissues pre- 
senting a lectin receptor in blood circulation, although such a targeting system is not found in all organ. This means that 
drug delivery by means of sugar chains is possible. In such drug delivery, the affinity of lectin for sugar chains is greatly 
influenced by the degree of freedom and the number of sugar chain ligands. Therefore, the branching structure of sugar 

45 chains will be the most important point in such drug delivery. 

(4) Correlation between Cells' Change into Morbid State and Sugar Chain Branched Structure thereof [Junko Kato, 
Naoko Suzuki, " Sugar Chain Technology and Development of Pharmaceutical", Foundation for the Relief and Study of 
Injury Caused by Pharmaceutical Side Effect (ed.), Yakugyo-Jiho-Sha, (1994), 107-13214] 

50 

[0009] Once a plant lectin called L-PHA was developed as a probe to detect a multi-branching type sugar chain struc- 
ture, it has become possible to examine various morbid tissue samples. As a result, a tendency has been found that 
some types of cancer cells, particularly, cancer cells with a high metastasis ability are stained well with L-PHA. Thus, 
researchers have become aware of the correlation between the branching structure of sugar chains and the metastasis 
55 ability of cancer cells. Human chorionic gonadotropin (hCG) is a glycoprotein hormone vigorously biosynthesized in vil- 
lus tissues at an early stage of pregnancy. Since a considerable amount of hCG is discharged into urine, hCG is clini- 
cally utilized as an indicator of pregnancy. The Asn-linked sugar chains mainly formed by the mono- and biantennary 
complex type chains are characteristic to hCG. As cancer increases its malignancy from trophoblastoma to invasive 
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mole and from invasive mole to choriocarcinoma, it is reported that 2,4,2 type tri-antennary sugar chains and abnormal 
biantennary sugar chains (both are formed through the action of GnT-IV on normal biantennary and mono-antennary 
sugar chains, respectively) appear in the sugar chains of hCG [Katsuko Yamashita, Protein, Nucleic Acid and Enzyme 
(1992), 37, 1880-1888). As a cause of this phenomenon, it is suggested that the activity of GnT-IV increases as the 
5 malignancy of choriocarcinoma progresses. 

[001 0] y-Glutamyltranspeptidase (y-GTP) is a glycoprotein occurring specifically abundant in liver. Since the serum r 
GTP level drastically increases when there is a liver disease, this level is used as clinical indicator of a liver disease. 
Further, Yamashita etal. [Yamashita, K. ( Totani, K., Iwaki, Y, Takamisawa. I., Takeishi, N., Higashi, T, Sakamoto, Y and 
Kobata, A., J. Biochem., (1989), 105, 728-735] have found that, as a result of canceration of cells, the sugar chain struc- 

io ture of y-GTP changes abnormally in its branching structure similar to those in abnormal hCG; thus, they have reported 
the correlation between canceration and the activation of GnT-IV. The Asn-linked sugar chains of y-GTP derived from 
healthy human liver cells are composed mainly of the biantennary complex type sugar chain with small amounts of tri- 
antennary and tetra-antennary sugar chains mixed therein. In contrast, a remarkable increase in the degree of branch- 
ing was observed in the Asn-linked sugar chains of y-GTP derived from human hepatoma cells. At the same time, 

is though small in amounts, high mannose type sugar chains and abnormal biantennary sugar chains (both of which were 
not observed in y-GTP from normal cells) appeared. As a cause of these changes in sugar chain structure, a possibility 
is suggested that N-acetylglucosaminyltransf erase IV (GnT-IV) and V (GnT-V) are activated in relation to canceration of 
liver cells (Yamashita et al., supra). 

[001 1] It is also reported that the sugar chain branching structure of a glycoprotein in cells is greatly changed by viral 
20 infection (Yamashita et al., supra). BHK cells have sugar chain structures with branching up to tetraantennary type. 
When BHK cells are transformed with polyomavirus, biantennary type sugar chains decrease in the glycoprotein sugar 
chains produced by the cells, while tetraantennary type sugar chains and the N-acetyllactosamine repeat structures 
increase; as a whole, a remarkable increase in the number of branches was recognized [Takasaki S., Ikehira, H. and 
Kobata A., Biochem. Biophys. Res. Commun., (1980), 90, (3), 735-742]. As a cause of the above change, activation of 
25 GnT-IV, GnT-V and i-GnT may be considered. 

3. Enzymes relating to the Sugar Chain Branching Structures of Glycoproteins 

[0012] The complex type sugar chain which is a glycoprotein sugar chain structure characteristic of animals has a 

30 complicated branching structure in which N-acetylglucosamine (GlcNAc) residues are attaching to the common core 
structure in various manners (Kiyoshi Furukawa, supra) (Fig. 1). Since this branching structure is closely related to in 
vivo and in vivo stability, localization, biological activity and pharmacological property of glycoproteins (Makoto 
Takeuchi, supra), the process of biosynthesis of the branching structure has been investigated in detail. By using inven- 
tive substrates H. Schachter et al. have discriminated the various enzyme activities in hen oviduct to thereby predict the 

35 presence of GlcNAc branch forming enzymes from GnT-l to GnT-VI (group of GlcNAc glycosy (transferases; Fig. 3) 
[Glesson. P. A. and Schachter, K, J. Biol. Chem., (1983), 258, 6162-6173]. Thereafter, GnT-l [Kumar, Ft., Yang, J., 
Larsen, R. D. and Stanley P., Proc. Natl. Acad. Sci. USA, (1990), 87, 9948-9952; Sarkar, M., Hull, E., Nishikawa, Y.! 
Simpson, R. J., Moritz, R. L, Dunn, R. and Schachter, H., Proc. Natl. Acad. Sci., USA. (1991), 88, 234-238], GnT-li 
[D Agostaro. GA., Zingoni, A., Moritz, RL, Simpson, RJ., Schachter, H. and Bendiak, B., J. Biol. Chem., (1995), 270, 

40 1 5211-21], GnT-lll [Nishikawa, A., Ihara, Y, Hatakeyama, M., Kangawa, K. and Taniguchi, N., J. Biol. Chem., (1992)! 
267, 18199-18204] and GnT-V [Shorebah, M. G.. Hindsgaul, O. and Pierce, M., J. Biol. Chem., (1992), 267, 2920-2927; 
Gu, J., Nishikawa, A., Turuoka, N., Ono, M., Yamaguchi, N., Kangawa, K. and Taniguchi, N., J. Biochem., (1993), 113, 
614-619] were successively purified, and the genes thereof were cloned. However, with these known GlcNAc trans- 
ferases alone, it is impossible to form the main sugar chain (tetraantennary type; see the formula below) found in u1 

45 acid glycoprotein known as a representative human blood glycoprotein [Ybshima, K. f Tsuji, T, Irimura, T and Osawa, 
T, J. Biol. Chem., (1984), 256, 10834-10840] and erythropoietin (Takeuchi, M., Takasaki, S. f Shimada, M. and Kobata! 
A., J. Biol. Chem., (1990), 265, 12127-12130]. Therefore, an N-acetylglucosaminyltransf erase having such substrate 
specificity and reaction specificity that are expected in GnT-IV has been searched for as a missing link. 
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[0013] In addition to those mentioned above, the following N-acetylglucosaminyltransferases have been purified or 
the genes thereof have been cloned: a transferase acting on mucin-type sugar chains [Bierhuizen, M. F, Maemura, K. 
and Fukuda, M., J. Biol. Chem., (1 994), 269, 4473-4479], a transferase acting on glycolipids, and a transferase forming 
the sugar chain epitope known as I • i antigenic structure [Kawashima, H., Yamamoto, K., Osawa, T. and Irimura, T., J. 
Biol. Chem., (1993), 268. 27118-27126; Bierhuizen, M. R, Mattei. M. G. and Fukuda, M M Genes Dev., (1993). 7, 468- 
478]. However, the substrate specificity of these transferases and the mode of binding of the GlcNAc group transferred 
by these transferases are different from those of GnT-IV. Any of these transferases does not yield products resembling 
GnT-IV products. 

Disclosure pf trie Invention 



[0014] It is an object of the present invention to provide an enzyme having pi -> 4 /V-acetylglucosaminyltransferase 
(hereinafter referred to as "GnT-IV") activity; a gene encoding the enzyme; a recombinant DNA comprising the gene; a 

is cell containing the recombinant DNA; a method for producing an enzyme protein having GnT-IV activity comprising cul- 
turing the cell in a medium; and a saccharide in which the sugar chains are modified with GnT-IV. 
[0015] Toward the solution of the above assignments, the present inventors have made intensive and extensive 
researches. As a result, the inventors have isolated and purified a GnT-IV enzyme protein from bovine small intestine, 
and characterized the biochemical properties of the protein. Then, the inventors have succeeded in cloning a gene cod- 

20 ing for bovine GriT-IVa from a cDNA iibrary and mRNA from the small iniesiine based on a partial amine acid sequence 
of the above enzyme protein. Further, based on bovine GnT-IVa gene, the inventors have succeeded in cloning two 
genes coding for human GnT-IVa and human GnT-IVb, from cDNA libraries and mRNAs from human liver and human 
lung, respectively. The present invention has been completed by confirming that the products of these genes exhibit 
GnT-IV activity 

25 [001 6] The first invention of the present application relates to a GnT-IV having an activity to produce a saccharide hav- 
ing a partial structure represented by the formula below: 



GlcNAc p 1 ^ 4 ^fMan 0 l-4GlcNAc- 



;^Man a 1 



35 using UDP-GlcNAc as a sugar donor and a saccharide having a partial structure represented by the formula below as 
a sugar receptor: 



w ~3 Man (3 1— 4GlcNAc- 
—2 Man a r 



45 [0017] The second invention relates to a GnT-IV consisting of the amino acid sequence shown in SEG ID NO: 18 or 
the amino acid sequence shown in SEG ID NO: 18 which has addition, deletion or substitution of one or more amino 
acid residues and yet which produces GnT-IV activity; a GnT-IV consisting of the amino acid sequence shown in SEQ 
ID NO: 24 or the amino acid sequence shown in SEQ ID NO: 24 which has addition, deletion or substitution of one or 
more amino acid residues and yet which produces GnT-IV activity; and a GnT-IV consisting of the amino acid sequence 

so shown in SEQ ID NO: 37 or the amino acid sequence shown in SEQ ID NO: 37 which has addition, deletion or substi- 
tution of one or more amino acid residues and yet which produces GnT-IV activity. 

[001 8] The third invention relates to a GnT-IV gene coding for a GnT-IV consisting of the amino acid sequence shown 
in SEQ ID NO: 18 or the amino acid sequence shown in SEQ ID NO: 18 which has addition, deletion or substitution of 
one or more amino acid residues and yet which produces GnT-IV activity; a GnT-IV gene coding for a GnT-IV consisting 
55 of the amino acid sequence shown in SEQ ID NO: 24 or the amino acid sequence shown in SEQ ID NO: 24 which has 
addition, deletion or substitution of one or more amino acid residues and yet which produces GnT-IV activity; a GnT-IV 
gene coding for a GnT-IV consisting of the amino acid sequence shown in SEQ ID NO: 37 or the amino acid sequence 
shown in SEQ ID NO: 37 which has addition, deletion or substitution of one or more amino acid residues and yet which 
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produces GnT-IV activity; a GnT-IV gene consisting of the nucleotide sequence shown in SEQ ID NO: 17; a GnT-IV 
gene consisting of the nucleotide sequence shown in SEQ ID NO: 23; and a GnT-IV gene consisting of the nucleotide 
sequence shown in SEQ ID NO: 36. 

[0019] The fourth invention relates to a recombinant DNA obtainable by inserting any of the above GnT-IV gene into 
5 a vector DNA; and a chromosomal fragment comprising a part or all of any one of the above GnT-IV gene. 

[0020] The fifth invention relates to a host cell carrying the above recombinant DNA; and a host cell into which the 
above chromosomal fragment is artificially introduced. 

[0021] The sixth invention relates to a method for producing a GnT-IV comprising culturing the above host cell in a 
medium and recovering the GnT-IV from the resultant culture; and a method for producing a GnT-IV comprising recov- 
io ering the GnT-IV enzyme from the secreta, body fluids or homogenete originated from the above host cell. 
[0022] The seventh invention relates to a method for purifying the GnT-IV from biological samples. 
[0023] The eighth invention relates to a saccharide of which the sugar chain structure is modified with the GnT-IV 
[0024] Hereinbelow, the present invention will be described in detail. 
[0025] The GnT-IV gene of the invention can be isolated as described below. 

15 

Isolation of Bovine GnT-IVa Gene 

[0026] First, a microsome fraction from bovine small intestine solubilized with a detergent is subjected to a series of 
purification procedures using anion exchange chromatography, copper chelate chromatography, two-step affinity chro- 
ma matography using a substrate analogue and gel filtration to thereby obtain a purified sample of GnT-IV enzyme. The 
resultant purified sample is subjected to SDS-PAGE and then transferred onto a PVDF membrane. The transferred pro- 
tein, as it is or after restricted hydrolysis, is analyzed with a gas phase amino acid sequencer to obtain a partial amino 
acid sequence for the GnT-IV enzyme. 

[0027] Subsequently, an RT-PCR is performed on the RNA extracted from the animal cells (i.e., bovine small intestine) 
25 as a template using primers designed based on the partial amino acid sequences determined above. Further, using a 
fragment obtained by the RT-PCR as a probe, the GnT-IV gene of interest is screened from a cDNA library from the 
above-mentioned tissue by plaque hybridization. A cDNA fragment contained in the resultant positive plaque is cut out 
and subcloned into a vector such as pUCl9, followed by analysis of the nucleotide sequence thereof. If the full length 
of the gene coding for the protein of interest is not contained in the fragment, plaque hybridization is performed again 
30 using a part of the subcloned cDNA fragment as a probe. Alternatively, terminal portions of the cDNA of interest are 
obtained by RACE or the like based on the information one the nucleotide sequence obtained above. The thus obtained 
GnT-IV gene (which is named GnT-IVa aferward) is subjected to analysis of its entire nucleotide sequence. Subse- 
quently, the amino acid sequence is translated from the gene having the above-mentioned nucleotide sequence This 
amino acid sequence is as shown in SEQ ID NO: 18. 

35 

Isolation of Human GnT-IVa and GnT-IVb Genes 

[0028] Human GnT-IVa and GnT-IVb genes can be obtained by performing a RT-PCR using RNA extracted from a 
human tissue (liver or lung) and based on the information on the nucleotide sequence of bovine GnT-IVa gene as 

40 obtained above, followed by screening of a cDNA library from the above tissue. The resultant human GnT-IVa and GnT- 
IVb genes are subjected to analysis of their entire nucleotide sequences. Subsequently, the amino acid sequences are 
translated by these genes. These amino acid sequences are as shown in SEQ ID NOS: 24 and 37. 
[0029] In order to obtain a DNA coding for the amino acid sequence shown in SEQ ID NO: 18, 24 or 37 having addi- 
tion, deletion or substitution of one or more amino acid residues, a number of methods may be used. For example, a 

45 method of treating DNA with a mutagen to induce point mutation or a deletion mutation; a method comprising cleaving 
DNA selectively, removing or adding a selected nucleotide and then ligating DNA; site-specific mutagenesis; and the 
like may be enumerated. 

[0030] The GnT-IV protein of the invention can be produced by preparing a recombinant vector into which a DNA cod- 
ing for the GnT-IV of the invention obtained by the method described above is inserted downstream of a promoter, intro- 

so ducing the vector into a host cell and culturing the resultant cell. The vector DNA used for this purpose may be either 
plasmid DNA or bacteriophage DNA. For example, pSVL vector (Pharmacia, Sweden) shown in an Example described 
later may be used. As the host cell into which the resultant recombinant DNA is introduced, any cell that is convention- 
ally used in recombinant DNA techniques may be used, for example, a prokaryotic cell, an animal cell, a yeart, a fungi, 
an insect cell. Specific examples include Escherichia coii as a prokaryotic cell and CHO cells from Chinese hamster or 

55 COS cells from monkey as an animal cell. 

[0031] The transformation of the host cell described above is performed by conventional methods for each host. For 
example, if the host is E. cofi, a vector comprising the recombinant DNA is introduced by the heat shock method or elec- 
troporation into competent cells prepared by the calcium method or the like. II the host is yeast, a vector comprising the 
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recombinant DNA is introduced by the heat shock method or electroporation into competent cells prepared by the lith- 
ium method or the like. If the host is an animal cell, a vector comprising the recombinant DNA is introduced into the cell 
at the growth phase or the like by the calcium phosphate method, lipofection or electroporation. 
[0032] By culturing the thus obtained transformant in a medium, the GnT-IV protein is produced. 

f [0033] In the cultivation of a transformant, any medium may be used as long as the host is viable in it. For example. 
LB medium or the like may be used if the host is E. coli. If the host is yeast. YPD medium or the like may be used. If the 
host is an animal cell, Dulbecco's medium supplemented with an animal serum or the like may be used. The cultivation 
is performed under conditions conventionally used for the host. For example, it the host is £. co// f cells are cultured at 
about 30-37°C for about 3-24 hours with, if necessary, aeration and/or agitation. If the host is yeast, cells are cultured 

jo at about 25-37°C for about 12 hours to 2 weeks with, if necessary, aeration and/or agitation. If the host is an animal cell, 
cultivation is performed at about 32-37°C under 5% C0 2 and 100% humidity for about 24 hours to 2 weeks with, if nec- 
essary, change of the aeration conditions and/or agitation. 

[0034] After the cultivation, the cultured microorganism or cells are disrupted using a homogenizer, French press, son- 
ication, lysozyme and/or freeze-thawing to thereby elute the GnT-IV protein outside the microorganism or cells. Then. 

75 the protein can be obtained from soluble fractions. If the protein of interest is contained in insoluble fractions, the insol- 
uble fractions are collected by centrifugation after disruption of the microorganism or cells. Then, the protein may be 
solubilized with a buffer containing guanidine hydrochloride or the like for recovery. Alternatively, the cultured microor- 
ganism or cells may be disrupted directly with a buffer containing a protein denaturing agent such as guanidine hydro- 
chloride to thereby elute the protein of interest outside the microorganism or cells. 

20 [0035] Purification of the GnT-IV protein from the above supernatant may be performed by the method described in 
Example 1 . Alternatively, this purification may be performed by appropriately combining conventional separation/pur ffi- 
cation methods. These conventional separation/purification methods include, but are not limited to, centrifugation, salt- 
ing out, solvent precipitation, dialysis, ultrafiltration, partition chromatography, gel filtration, capillary electrophoresis, 
TLC, ion exchange chromatography, metal chelate chromatography, affinity chromatography, reversed phase chroma- 

25 tography and isoelectric focusing. 

[0036] The biochemical properties of the GnT-IV enzyme protein obtained from bovine small intestine as described 
above are as follows. 



30 



35 



45 



(1) Action 

[0037] This enzyme protein produces a saccharide having a partial structure represented by the formula below: 



GlcNAc Bl^ A ^"fMan 0 1— 4GlcNAc- 



40 using UDP-GlcNAc as a sugar donor and a saccharide having a partial structure represented by the formula below as 
a sugar receptor: 



^Man p l— 4GlcNAc- 



so [0038] The saccharide as a sugar receptor means an oligosaccharide, polysaccharide, glycoconjugate (glycopeptide, 
glycoprotein, glycolipid or proteoglycan) or a derivative thereof. 

(2) Substrate Specificity 

55 [0039] When the sugar receptor is an oligosaccharide (for the structures of oligosaccharides, see Fig. 4), the enzyme 
protein exhibits reactivities of 0% toward core type oligosaccharides, 54% toward GnT-l product type oligosaccharides 
and 164% toward GnT-V product type oligosaccharides, wherein the reactivity of the enzyme protein toward GnT-ll 
product type oligosaccharides is regarded as 100%. 
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[0040] The enzyme protein exhibits a reactivity of 46% toward a structure of GnT-ll product type oligosaccharides in 
which fucose is attached via a1 6 linkage to the GlcNAc at the reducing terminus. 

[0041] The enzyme protein exhibits a reactivity of 0% toward a structure of GnT-ll product type oligosaccharides in 
which the GlcNAc on the <x1 3 mannose is lacking. 
5 [0042] The enzyme protein exhibits a reactivity of 16% toward a structure of GnT-ll product type oligosaccharides in 
which galactose is attached via p1 -> 4 linkage to the GlcNAc on the a1 6 mannose, and a reactivity of 0% toward a 
structure of GnT-ll product type oligosaccharides in which galactose is attached pi -> 4 linkage to the GlcNAc on the 
a1 -> 3mannose. 

[0043] The enzyme protein exhibits a reactivity of 0% toward a structure of GnT-ll product type oligosaccharides in 
10 which GlcNAc is attached via 01 -> 4 linkage to the p1 4 mannose. 

(3) Molecular Weight 

[0044] About 66 K as determined by SDS-PAGE (under non-reducing conditions). About 60 K after treatment with 
is peptide N-glycosidase F. Since a shift of band is observed when peptide N-glycanase is used, the enzyme protein is 
thought to be a glycoprotein. 

[0045] The apparent molecular weight as determined by filtration with a gel containing Triton X-100 is 77 K. Thus, it 
is thought that GnT-IV does not have a subunit structure and functions as a monomer. 

[0046] The protein moiety of this enzyme deduced from the nucleotide sequence thereof consists of 535 amino acid 
20 residues and has a molecular weight of 61614. 

(4) Optimum pH 

[0047] The optimum pH for reaction is about 5 5. More than 50% of the maximum activity is observed in the range 
25 from pH 6.5 to 8.0. 

(5) Inhibition, Activation and Stabilization 

(i) Inhibition 

30 

[0048] The activity of this enzyme is inhibited by addition of 20 mM EDTA. 

[0049] This enzyme is inhibited by UDP derivatives. The intensity of inhibition is in the following order: UDP»UDP- 
Glc>UDP-GalNAc»2-deoxy UDP>UDP-hexanolamine»UDP-Gal>UTP>UDP-glucuronic acid>UMP. 
[0050] Uridine, TDP and CDP do not have inhibitory effect. 

35 

(ii) Activation 

[0051] A divalent cation is essential for expression of the activity. Among divalent cations, Mn 2+ shows the greatest 
effect. At a concentration of 7.5 mM, the respective effects of Co 2+ and Mg 2+ are about 70% of that of Mn 2+ , and the 
40 effect of Ca 2+ is about 1 0% of that of Mn 2+ . The effect of Mn 2+ is greatest in the range from 5 to 20 mM. 

(iii) Stabilization 

[0052] Stabilizing effect is recognized in BSA and glycerol. 

45 

(6) Kinetic Parameters 

[0053] When the saccharide as a receptor is an oligosaccharide (for the structures of oligosaccharide, see Fig. 4). 

so (i) under assay conditions in which the enzyme is reacted in 50 j^l of 125 mM MOPS buffer (pH 7.3) containing 0.8 

mM receptor substrate, 20 mM UDP-GlcNAc, 7.5 mM MnCI 2 , 200 mM GlcNAc, 0.5% (w/v) Triton X-100 10% glyc- 
erol and 1 % BSA at 37°C for 4 hours: 

Km and Vmax values toward GnT-ll product type oligosaccharide are 0.73 mM and 3.23 pM/min, respectively. 
55 Km and Vmax va| ues toward GnT-V product type oligosaccharide are 0.13 mM and 1.75 pM/min, respectively. 



When GnT-ll product type oligosaccharide is the receptor substrate, Km value toward UDP-GlcNAc is 0.22 
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mM. 

(ii) under assay conditions in which the enzyme is reacted in 125 mM MOPS buffer (pH 7.3) containing 120 mM 
UDP-GlcNAc, 7.5 nM MnCI 2 , 0.5% (w/v) Triton x-100, 10% glycerol and 1% BSA at 37°C for 4 hours: 

£ Km and Vmax values toward GnT-II product type oligosaccharide are 0.59 mM and 0.74 mM/min/mg, respec- 

tively 

Km and Vmax values toward GnT-V product type oligosaccharide are 0.14 mM and 0.47 mM/min/mg, respec- 
tively 

io (7) GnT-IV Family 

[0054] The homology between bovine GnT-lva and human GnT-IVa is 91% at the nucleic acid level and 96% at the 
amino acid level. 

[0055] All ol the partial amino acid structures contained in the purified GnT-IV from bovine small intestine are encoded 
is in the bovine GnT-IVa gene. 

[0056] Human GnT-IVb and human GnT-IVa have 63% homology at the nucleic acid level and 62% homology at the 
amino acid level. However, they are entirely different in the C-terminal and N-terminal regions. 

[0057] From the biochemical properties described above, the GnT-IV of the invention has been recognized as a novel 
enzyme in the point that this enzyme is able to perform the following reaction which conventional enzymes cannot per- 
20 form: 



UDP-GlcNAc 



UDP 



25 



— Man a 1 



GlcNAc 0 I — 2Man o 1 



— Man a. 1^ 



3 Man 0 1 ~~ 



GlcNAc 0 1^ 
GlcNAc 0 T 



| Man a 



3Man 0 1 — 
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Brief Descr iption of the Drawings 
[0058] 



Fig. 1 shows the biosynthetic pathway of Asn-linked sugar chains. 

Fig. 2 shows variations of the Asn-linked sugar chain (revised from Fig. 1 in Makoto Takeuchi, Wako Purechemical 
40 Newsletter 64, 18-19, 1996). 

a. Mannan type: a sugar chain structure characteristic of fungi such as yeasts and molds. 

b. Xylo-high-mannose type: a structure characteristic of plants, mollusks and insects. 

c. High mannose type: a structure commonly seen in plants, insects and animals. 
45 d. Hybrid type: a structure commonly seen in insects and animals. 

e. Complex type: a structure characteristic of animals. 

f. Prokaryotic cells: have no system for biosynthesis of Asn-linked sugar chains. 



so 



The portion boxed with dotted lines represents the common core sugar chain. 
Fig. 3 shows the positions of GlcNAc transfer by various GlcNAc transferases (GlcNAc glycosyltransferases). 
Fig. 4 shows the designations and structures of oligosaccharides. 
Fig. 5 shows a high performance liquid chromatogram for GnT-IV reaction products. 
Fig. 6 shows the results of analysis of GnT-IV by O-Sepharose FF chromatography. 
Fig, 7 show the results of analysis of GnT-IV by copper chelate Sepharose FF chromatography. 
Fig. 8 shows the results of analysis of GnT-IV by UDP-Hexanolamine Agarose affinity chromatography (I). 
Fig. 9 shows the results of analysis of GnT-IV by UDP-Hexanolamine Agarose affinity chromatography (II). 
Fig. 10 shows the results of analysis of GnT-IV by Superdex 200 gel chromatography. 
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Fig. 11 is a photograph showing the results of SDS-PAGE (SDS polyacrylamide gel electrophoresis) of purified 
GnT-IV. 

Fig. 12 shows the results of native gel electrophoresis of purified GnT-IV and the activity thereof. 
Fig. 13 shows Smith degradation profile of GnT-IV, -V and VI product type oligosaccharides. 
5 Fig. 14 shows the results of 1 H-NMR (30°C) of the GnT-IV reaction product. 

Fig. 15 shows the optimum pH for GnT-IV 
Fig. 16 shows the optimum Mn 2+ concentration for GnT-IV. 

Fig. 1 7 shows the results of analysis by SDS-PAGE and f luorochromatography of glycoproteins which are the reac- 
tion products of GnT-IV. 

70 

Lanes 1 and 2: 7.6 ug of asialo agalacto human transferrin 
Lane 3: 7.6 ^g of asialo human transferrin 

Lanes 4 and 5: 2.8 ug of asialo agalacto, CHO cell-derived recombinant human erythropoietin 
Lanes 6 and 7: 1 .3 ug of asialo agalacto fetuin 

75 

Lanes 1, 4 and 6 represent mock experiments in which reaction was performed without GnT-IV. M represents 
molecular markers (Bio-Rad). PM represents pre-stained molecular markers (Bio-Rad, USA). 

GnT-IV reaction conditions: To 10 ul of a solution containing 0. 702 mnol/hr of GnT-IV, a substrate glycoprotein 
20 equivalent to 1.6 nmol of biantennary type sugar chains (for fetuin alone, the sugar chain content was 1.6 nmol) 

and 450 nCi of UDP-[ 14 C) GlcNAc. an equal volume of an assay mixture (250 mM MOPS buffer, pH 7.3, 400 mM 
GlcNAc, 20% glycerol, 1 .0% (w/v) Triton X-1 00, 15 mM MnCI 2 . 1 mM UDP-GlcNAc) was added to obtain a reaction 
solution, which was incubated at 37°C for 20 hours. One tenth of the resultant solution was analyzed by SDS-PAGE 
and fluorography 

25 For SDS-PAGE, 10-20% gradient gel (Daiichi Kagaku) was used. For fluorography, Amplify (Amersham) was 

used to expose the samples to X ray film for 20 hours. The visualization of biantennary sugar chains in the glyco- 
protein was performed using ConA-HRP and POD-lmmunostain Kit (Wako Purechemical) after dot-blotting onto a 
PVDF membrane. 

Fig. 18 shows the open reading frame of human GnT-IVa and the region contained in pCore-His expression vector. 
30 Fig. 19 shows the results of isoelectric focusing and Western analysis of erythropoietins produced by individual cell 

clones. Using two erythropoietin-producing strains and the same strains into which bovine and human GnT-IVa 
genes were introduced, respectively, the erythropoietin secreted by each strain was analyzed by isoelectric focus- 
ing and Western blotting using anti-erythropoietin antibody. On the left side, the positions of pi markers are shown. 

35 Best Modes for Carryin g Out the Invention 

[0059] Hereinbelow, the present invention will be described in more detail with reference to the following Examples. 
However, the present invention is not limited to these Examples. 

40 [Reference Example 1] 

(1) Reagents Used in the Examples 

[0060] Unless indicated otherwise, the reagents used were the highest grade products manufactured by Wako 
45 Purechemical Industries, Ltd. 

(i) Pyridylaminated Oligosaccharides 

[0061] Each of the pyridylaminated oligosaccharides used was obtained as described below. First, pyridylaminated 
50 oligosaccharides were prepared from human transferrin (apo type; Sigma, USA) according to the method of Tokugawa 
et al. [Biehuizen. M.F., Mattel, M.G. and Fukuda. M. (1993) Genes Dev., 7, 468-478]. The resultant material was treated 
with one or a combination of the following enzymes: Arethrobacter ureafaciens -derived sialidase (Nacalai Tesque). 
Aspergillus sp.-derived p-galactosidase (Toyobo), Jack bean-derived p-N-acetylhexosaminidase (Seikagaku Corp.), 
GnT-V active fraction in CHO-K1 cell extract (supernatant obtained by sonication of CHO-K1 cells in 2 volumes of 5 mM 
55 Tris-HCI buffer. pH 7.5. containing 2 mM MgCI 2 and 1 mM PMSF. and then centrifugation at 900xg for 10 min), and the 
GnT-V active fraction in the solubilized fraction from bovine small intestine homogenate (for the method of preparation, 
see "Preparation of the Microsome Fraction" and "Solubilization" in Example 1). A part of the pyridylaminated oligosac- 
charides were prepared by treating PA-Sugar Chain 021 and 022 (Takara Shuzo) with the above enzymes. In both 
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cases, the oligosaccharides prepared were purified by reversed phase chromatography using an ODS column (10 x 
250 mm; Vydac, USA) before use. 

(ii) Glycoprotein Substrates 

£ 

[0062] Bovine fetuin (Sigma, USA) and CHO cell-derived recombinant human erythropoietin (Kirin Brewery) were 
subjected to the following pretreatment to purify them into relatively uniform glycoform. Briefly, 40-100 mg of the glyco- 
protein was applied to a ConA-Sepharose column (5 ml; Pharmacia, Sweden) equilibrated with 10 mM Tris-HCI buffer, 
pH 7.4, containing 1 mM MgCI 2 . 1 mM CaCI 2 and 0.15 M NaCI to thereby obtain a glycoform with a low biantennary 

io sugar chain content as the non-adsorbed fraction. Thereafter, the column was eluted with the above buffer containing 
1.0 M a-methyl mannoside (Nacalai Tesque) to thereby obtain the fraction adsorbing a glycoform with a high bianten- 
nary sugar chain content. Thus, fetuin with a low biantennary sugar chain content and erythropoietin with a high biant- 
ennary sugar chain content were obtained. With respect to human transferrin, there is no need to purify this 
glycoprotein since almost all sugar chains thereof are biantennary. 

75 [0063] The thus obtained fetuin and human transferrin were individually reacted with 1 U of sialidase and 0 or 107 U 
of p-galactosidase in 1 ml of 0.4 M sodium acetate buffer, pH 5.0, containing 4 mM MgCl2 at 37°C for 16 hours to 
thereby obtain asialo or asialo agalacto glycoproteins. The erythropoietin with a high biantennary sugar chain content 
was reacted with 0.5 U of sialidase and 5 U of p-galactosidase in the same manner as described above to obtain an 
asialo agalacto glycoprotein. 

20 [0064] Each of the thus obtained glycoprotein substrates was diaiyzed against 50 mM ammonium aceiate buffer, pH 
7.3. Then, the amount of protein was determined with BCA protein assay (Pierce, USA) using BSA (bovine serum albu- 
min) as a standard. Further, the protein was analyzed by SDS-PAGE (sodium dodesyl sulfate polyacrylamide gel elec- 
trophoresis). The thus prepared glycoproteins were used in the Examples. 

25 (iii) RT-PCR (Reverse Transcription-Polymerase Chain Reaction) 

[0065] For RT-PCR. Access RT-PCR System (Promega. USA) was used. For the amplification of a fragment of a gene 
of interest. Pfu polymerase (Stratagene, USA) was used. 

30 (2) Equipment used in the Examples 

(i) Gene Sequencing 

[0066] ABI PLISM 377 DNA Sequencer (Perkin-Elmer, USA) was used. 

35 

[REFERENCE EXAMPLE 2] Specific Assay for GnT-IV Activity 

[0067] Generally, there are two methods for assaying GnT-IV activity: a method in which the transfer of radiolabeled 
GlcNAc to an oligosaccharide substrate is examined and a method in which the transfer of GlcNAc to a labeled oli- 

40 gosaccharide substrate is fractionally analyzed by HPLC or the like. Taniguchi et al. developed a method in which GnT- 
III, -IV and -V activities are simultaneously determined using the GnT-ll product type oligosaccharide as a receptor 
[Nishikawa, A. ( Fujii. S., Sugiyama, T. and Taniguchi, N. (1988) Anal. Biochem., 170, 349-354]. However, this assay 
method as it is was inappropriate for the assay during purification of GnT-IV because the relative activity of GnT-IV is 
much lower than those of GnT-lll and -V. 

<5 [0068] Then, the present inventors have developed a method for determining GnT-IV activity quantitatively and sen- 
sitively by increasing the amount of the acceptor py r idyl ami nated oligosaccharide to 10-fold compared to the amount 
used in the previous assay system [Tokugawa, K., Oguri, S. and Takeuchi, M. (1996) Glycoconjugate J., 13, 53-56]. 
Generally, it is very difficult to prepare such a large amount of acceptor oligosaccharides. However, according to the 
method of Tokugawa et al. [Tokugawa. K., Oguri, S. and Takeuchi, M. (1996) Glycoconjugate J.. 13. 53-56], such oli- 

so gosacchardes are readily prepared. 

[0069] In Examples of the present invention, GnT-IV activity was assayed as described below. 
[0070] The enzyme was reacted in 125 mM MOPS [3-(N-morpholino)propane-surfonic acid] buffer, pH 7.3. containing 
0.8 mM pyridylaminated oligosaccharide substrate (GnT-ll product type oligosaccharide substrate). 20 mM UDP-Glc- 
NAc. 7.5 mM MnCI 2 . 200 mM GlcNAc, 0.5%(w/v) Triton X-100, 10% glycerol and 1% BSA at 37°C for 4 hours. Then. 

55 the reaction was terminated by boiling the solution for 2 minutes. After removal of solids with a 0.45 nm filter, 5 pi of the 
filtrate was analyzed with an ODS-80TM column (4.6 x 1 50 mm; TOSO) (Fig. 5) at 50°C with 50 mM ammonium acetate 
buffer, pH 4.0, containing 0.15% (w/v) n-butanol at a flow rate of 1.2 ml/min. The fluorescence of pyridylamino groups 
was detected using excitation at 320 nm and emission at 400 nm. 
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[EXAMPLE 1] Isolation and Purification of the Enzyme 
(1) Screening of a Source of the Enzyme 

[0071] A source of the GnT-IV enzyme to be purified was searched for by utilizing the assay method described above. 
It was found that the relative activity of GnT-IV to those of GnT-lll and GnT-V in bovine small intestine is rather higher 
than the relative activities of GnT-IV in any other tissues as shown in Table 1 . Thus, bovine small intestine was selected 
as a starting material for purification. 



Table 1 



Search for Sources of GnT-IV Enzyme 




Specific Activity (pmol/h • mg- 
protein) 


Source of the Enzyme 


IV 


Ill 


V 


Cultured cells 


CHO 


10.8 


0 


1097 




Bowes 


12.0 


341 


150 




AH66 1 > 


2.0 


634 


30 




Solid AH 1 ) 


27 


116 


80 




Yoshida sarcoma 1 > 


1.3 


70 


109 


Rat organs 1 > 


Small intestine 


17 


280 


68 




Heart 


9.4 


11 


10 




Spleen 


20 


100 


21 




Kidney 


1.9 


1840 


30 




Brain 


3.7 


660 


38 


Human 1 } 


Liver 


2.8 


8.1 


8.2 


Bovine organs 


Small intestine 


25 


174 


*\ 




Heart 


N.D. 


N.D. 


N.D. 




Spleen 


10.9 


0.7 


10.9 


Colostrum 




N.D. 


N.D. 


N.D. 


N.D.: below detection limit 



^ : data from Nishikawa, A. et al., BBA 1035, 313-31 8 (1990) 

(2) Purification 

[0072] Unless otherwise indicated, all the operations were performed at 4°C. 
(i) Preparation of the Microsome Fraction 

[0073] Two kilograms of bovine small intestine (obtained from a meat processor) was minced. Then, 4 volumes of an 
extraction buffer (10 mM Tris-HCI buffer, pH7.4 ( containing 0.25 M sucrose, 1 mM phenylmethylsulfonyl fluoride, 1 mM 
benzamidine hydrochloride, 1 mM dithiothreitol and 10 mg/ml antipain) was added thereto and homogenized with Pol- 
ytron (Kinematica, Sweden). The resultant homogenate was centrifuged at 900xg for 10 minutes. Then, the superna- 
tant was centrifuged further at 105,000xg for 60 minutes to thereby obtain the microsome fraction as a precipitate 
(Sample 1). 

(it) Solubilization 

[0074] Sample 1 was suspended in 3 volumes of a solubilization buffer prepared by adding Triton-1 00 to the extraction 
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buffer to give a final concentration ol 1%. The supernatant was obtained by the centrifugation at 105.000 xg for 60 min- 
utes. The pellet was suspended again and collect the supernatant. The first and second extracts were combined (Sam- 
ple 2). 

5 (iii) Q-Sepharose FF Chromatography 

[0075] Sample 2 was applied to Q-Sepharose FF Column (5 x 30 cm; Pharmacia. Sweden) pre-equilibrated with oper- 
ation buffer 1 (20 mM Tris-HCI, pH 7.4, containing 1 mM benzamidine hydrochloride, 0.1% Triton X-100 and 20% glyc- 
erol) and then eluted by a linear gradient of 0-0.5 M KCI (Fig. 6)(Sample 3). 

w 

(iv) Copper Chelate Sepharose FF Chromatography 

[0076] Sample 3 was applied to Copper Chelate Sepharose FF Column (5x10 cm; Pharmacia, Sweden) pre-equili- 
brated with operation buffer 2 (obtainable by adding KCI to operation buffer 1 at a final concentration of 0.15 M). Then, 
75 non-adsorbed fractions were washed out with 5 volumes of operation buffer 2. Thereafter, the adsorbate was eluted by 
a linear gradient of 0.01 M glycine (Fig. 7). The resultant GnT-l V active fraction was pooled and concentrated with YM30 
ultrafiltration membrane (Amicon, USA) (Sample 4). 

(v) UDP-Hexanolamine Agarose Affinity Chromatography I 

20 

[0077] To UDP-Hexanolamine Agarose Affinity Column (1.2 x 4.5 cm; Sigma, USA) pre-equilibrated with operation 
buffer 3 (20 mM Tris-HCI. pH 8.0. containing 0.15 M KCI, 10 mM MnCI 2 . 0.05% Triton X-100 and 20% glycerol), one half 
of Sample 4 dialyzed against 1 mM benzamidine hydrochloride-added operation buffer 3 was applied. Then, non- 
adsorbed fractions were washed out with operation buffer 4 (20 mM Tris-HCI, pH 8.0, containing 10 mM MnCI 2 , 0.05% 
25 Triton x-100 and 20% glycerol). Thereafter, the adsorbate was eluted with operation buffer 4 to which 1 M (final concen- 
tration) KCI had been added (Fig. 8). The GnT-IV active fraction was pooled and dialyzed against operation buffer 5 
(having the same composition as that of operation buffer 4 but having a pH of 7.4) (Sample 5). 

(vi) UDP-Hexanolamine Agarose Affinity Chromatography II 

30 

[0078] Sample 5 was applied to UDP-Hexanolamine Agarose Affinity Column (1.0 x 6.5 cm; Sigma, USA) pre-equil- 
ibrated with operation buffer 5. Then, non-adsorbed fractions were washed out with operation buffer 5. Thereafter, the 
adsorbate was eluted with MnCI 2 -removed operation buffer 5 (Fig. 9). The resultant GnT-IV active fraction was pooled 
(Sample 6). 

35 

(vii) Superdex 200 Gel Chromatography 

[0079] Sample 6 was concentrated with a small Q-sepharose FF column and applied to Superdex 200HR5/5 Column 
(1 x 30 cm; Pharmacia, Sweden) pre-equilibrated with operation buffer 6 (obtained by adding KCI to operation buffer 5 
40 at a final concentration of 0.15 M) (Fig. 10). Operation buffer 6 was applied to the column at a flow rate of 0.25 ml/min 
to thereby obtain the GnT-IV active fraction (Sample 7). 

(viii) The amount of protein, activity and specific activity in each purification step are summarized in Table 2. The final 
sample was purified 224.000-fold compared to the small intestine homogenate. 

45 



50 



55 
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[0080] 



Table 2 



5 


Purification of GnT-IV 






Purification otep 


Mrnt/Urii cm jjiuigiii 
(ma) 


TntAl enzvme 
activity (nmol/h) 


Specific activity 
(nmol/h/mg) 


Yield (%) 


Purification factor 
(-fold) 




Bovine small 


112,900 


49,500 


0.44 


100 


1 


70 


iniesiine nomovjc* 
nate 














Solubilized frac- 


24,100 


14,500 


0.60 


29 


1.4 




tion 












15 


Q-Sepharose 


4,000 


7,200 


1.80 


14 


4.1 




Cu Chelate 


450 


3,670 


8.10 


7.4 


18.4 




Sepharose 














UDP-Hex- 


0.59 


1,950 


3.310 


3.9 


7,510 


20 


anolamine 1 












UDP-Hex- 
anolamine II 


0.035 


1,420 


40,600 


2.9 


92,200 




Superdex 200 


0.008 


790 


98,800 


1.6 


224,000 


25 


Started from 2 kg o 


f bovine small intestine. 



(3) Properties in terms of Enzyme Chemistry and Protein Chemistry 



30 (i) Purity 

[0081] Sample 7 gave a single band at a molecular weight of 60 K in SDS-PAGE (Fig. 1 1 ). When Sample 7 was sub- 
jected to native-PAGE and the resultant band was cut out from the gel to determine GnT-IV activity, the location of the 
protein band agreed with the location of the activity (Fig. 12). Furthermore, any of GnT-l, -II -III or -V activity was not 
detected in Sample 7. From these findings, it was concluded that Sample 7 is pure GnT-IV. Taking into account that the 
apparent molecular weight of this protein was 77 K as determined by Triton X- 100 -containing gel filtration (Fig. 10), it is 
thought that GnT-IV does not have a subunit structure and expresses its activity in the form of a monomer. When Sam- 
ple 7 was treated with Peptide N-Glycosidase F (Boehringer-Mannheim, Germany), an increase in mobility was 
observed on SDS-PAGE. Thus, it is thought that GnT-IV from bovine small intestine is a glycoprotein having at least 
Asn-linked sugar chains. 



35 



40 



(ii) Reaction Specificity 

[0082] When this enzyme reacts on the GnT-l I product type oligosaccharide represented by the formula below as a 
45 substrate: 



50 



GlcNAc p 2Man cl Nfi 
GlcNAc p I— 2Man a \s* 



3 Man p l— 4GlcNAc p l— 4GtcNAc-PA 



55 under the standard assay conditions, the enzyme yielded a single product (pyridylaminated oligosaccharide 1) as 

assayed by HPLC. . 
[0083] This product was collected, followed by determination of its structure by (i) a combination of Smith degradation 
and laser TOF-MS (time-of -flight mass spectrometer) and (ii) 1 H-NMR. Thus, the reaction specificity of this enzyme was 
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examined. When pyridylaminated oligosaccharide 1 was subjected to Smith degradation according to the method of 
Kobata and Takasaki [Kobata. A. and Takasaki, S. (1993) in Glycobiology "A Practical Approach" (Fukuda, M. and 
Kobata, A., eds) 165-185. IRL Press, Oxford. England], its mass number changed from 1599.0 to 795.30 as a result of 
the first degradation and further changed to 634.68 as a result of the second degradation. This agrees to the reaction 
5 pathway as shown in Fig. 13. Thus, it was concluded that the reaction product of this enzyme has the following struc- 
ture: 



GlcNAc 0 1 — 2 Man a 

;Man<r r 



10 GlcN Ac (3 1 ^ 4 ^Man p l~4GlcNAc (3 i— 4GlcNAc -PA 



GlcNAc (3 I 



'2 



75 



[0084] Further, when pyridylaminated oligosaccharide 1 was subjected to 1 H-NMR, a peak of 4.53 ppm which corre- 
sponds to the anomeric proton of GlcNAc7 shown in the following formula was detected; its coupling constant J1 ,2 was 
7.9 Hz (Fig, 14). These results indicate that GlcNAc7 is, as shown in the formula below, attached to position 4 of Man4 
io via p-iype linkage, SuppOriing the above Siruciure compieieiy. 

5* .4- 

GlcNAc— 2 Man ~ . 

25 7 J 2 1 

GlcNAc ^ 3 Man— 4GlcNAc-~ oGlcNAc-PA 

GlcNAc— ^Man ^ 
5 2 4 

30 



(iii) Optimum pH 

35 [0085] The optimum pH of this enzyme is around 7.5 as shown in Fig. 15. 

(iv) Requirement of Divalent Cation 

[0086] As shown in Table 3, this enzyme is deactivated by the addition of EDTA (ethylenediamine tetra-acetic acid). 
ao A divalent cation is essential for its activity. Among divalent cations, Mn 2+ shows the greatest effect, followed by Co 2+ 
and Mg 2+ . Weak effect is recognized in Ca 2+ and Fe 2+ . The optimum concentration of Mn 2+ is around 10 mM as shown 
in Fig. 16. 



Table 3 



Divalent Cation Require- 
ment of GnT-IV 


Additive 


Activity (%) 


None 


5.6 


EDTA 


0 


MnCI 2 


100 


CoCI 2 


74.8 


MgCI 2 


72.5 


CaCI 2 


7.2 
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Table 3 (continued) 



Divaient Cation Require- 


ment of GnT- IV 


Additive 


Activity (%) 


FeCI 2 


9.8 


CuCI 2 


0 



to [0087] GnT-i V activity was determined by adding each of the metal ions (1 0 mM) to a GnT- IV sample from which metal 
ions had been removed. GnT-IV activity is represented in per cent in the Table, wherein the activity when 10 mM MnCI 2 
was added is regarded as 100%. 

(v) Inhibition by Sugar Nucleotides 

75 

[0088] As shown in Table 4, UDP inhibited the activity of this enzyme most strongly. The inhibitory effects of UDP- 
glucose, UDP-GalNAc, 2'-deoxy-UDP and UDP-hexanolamine (Sigma, USA) followed that of UDP in this order. Uridine, 
UMP, TDP and CDP exhibited little inhibitory effect. 

20 

Table 4 



Inhibition of GnT-IV by Nucleotides 


Additive 


Activity (%) 


None 


100 


Uridine 


115 


UMP 


97.3 


UDP 


27.3 


UTP 


88.2 


TDP 


110 


CDP 


112 


2'-deoxy-UDP 


67.4 


UDP-hexanolamine 


73.6 


UDP-glucose 


56.6 


UDP-galactose 


87.3 


UDP-glucuronic acid . 


92.3 


UDP-N-acetylgalactosamine 


59.7 



45 [0089] GnT-IV activity when each nucleotide (2mM) was added in the presence of 0.5 mM UDP-GlcNAc is expressed 
in per cent in the Table, wherein the activity when nothing was added is regarded as 100%. 

(vi) Substrate Specificity 

so [0090] As shown in Table 5, this enzyme preferred the GnT-V product type oligosaccharide (E in Table 5) the most as 
an acceptor. Next to this, the enzyme preferred the GnT-ll product type oligosaccharides (D in Table 5). 
[0091] When the reactivity of this enzyme toward the GnT-ll type oligosaccharide is regarded as 100%, this enzyme 
exhibits reactivities of 0% and 54% toward the core type oligosaccharides (A in Table 5) and the GnT-l product type oli- 
gosaccharides (C in Table 5), respectively. 

55 [0092] This enzyme exhibits a reactivity of 46% toward a structure of GnT-ll product type oligosaccharide in which 
fucose is attached via a1 6 linkage to the GlcNAc at the reducing terminus (F in Table 5). 

[0093] This enzyme exhibits a reactivity of 0% toward a structure of GnT-ll product type oligosaccharides in which the 
GlcNAc on the a1 ~> 3 mannose is lacking (B in Table 5). 



16 



EP 0 905 232 A1 



[0094] This enzyme exhibits a reactivity of 16% toward a structure of GnT-ll product type oligosaccharides in which 
galactose is attached via p1 -> 4 linkage to the GlcNAc on the a1 -» 6 mannose (G in Table 5), and a reactivity of 0% 
toward a structure of GnT-ll product type oligosaccharide in which galactose is attached via p1 -» 4 linkage to the Glc- 
NAc on the al -> 3 mannose (H and I in Table 5). 
s [0095] This enzyme exhibits a reactivity of 0% toward a structure of GnT-ll product type oligosaccharides in which 
GlcNAc is attached via pi -> 4 linkage to the p1 -> 4 mannose (J in Table 5). 

[0096] The substrate specificity of this enzyme as described above almost agree with the substrate specificity of GnT- 
IV predicted by Schachter et al. [Glesson, P. A. and Schachter, H. (1983) J. Biol. Chem., 258, 6162-6173]. Thus, it has 
become clear that this enzyme of the invention is the very GnT-IV that has long been a missing link in the biosynthesis 
io of complex type sugar chains. 
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Table 5 



Relative Activity of 
Receptor Oligos accharide GlcNAc Transfer (%) 

A Man o: i _ f\ 
6 Man 0 i - 4GlcNAc 0 x — 4GlcNAc-PA 0 

Man a l 

B GlcNAc 0 i— 2Man a l ^ da ° 

6 Man 0 l - 4GlcNAc 3 i — 4GlcNAc-PA 

y5 Man a i 



Man a l ^ 

6 Man 3 l — 4GlcNAc 0 l — aGlcNAc-PA 
GlcNAc 0 i — ' 2Man a I -* 3 

GlcNAc 3 l — 2Man a i ^ irtrN1Ac PA 

6 Man 3 i — 4GlcNAc 0 i — 4GlcNAc-PA 

GlcNAc 3 l — 2Man a l ^ 



54 



100 



E GlcNAc 0 l ^ ^ 

25 ° Man 



GlcNAc U i^*' | Man 0 l — 4GlcNAc 0 l — 4GlcNAc-PA 



GlcNAc 0 l — 2Man 

Fuc a 1 

I 

GlcNAc 0 i - .Man « x ^ ^ ^ ^ _^ fi t ^ < GIcMAc . PA 46 

GlcNAc j3 l — * 2Man a i ^ 

Gal 0 l — 4GlcNAc 3 i — 2Man a l ^ 6 

3 Man p i — ' ^uitiNrtu m - 

GlcNAc 0 i — » 2Man a v* 16 



35 .3 

GlcNAc 0 i — » 2Man a i 

H GlcNAc 3 1 — 2Man a i ^ 



Man 0 i — 4GlcNAc 0 l — 4GlcNAc-PA 



Gal 0 l — 4GlcNAc 5 l — ' 2Man a l >■ 
Gal 0 i — 4GlcNAc 0 I — :Man a l 



>an 0 i -^GlcNAc 0 i - 4GlcNAc-PA ^ 



Gal 0 i — 4GlcNAc 0 l — 2Man a i' 



Man 0 l - 4GlcNAc 3 l - 4GtcNAc-PA 



GlcNAc 0 i — 2Man a l 6 

GlcNAc 0 1 — 4 Man J I - 4GlcNAc 0 l - 4ClcNAc-PA 
GlcNAc 0 i — 2Man a i ^* * 



so 



(vii) Kinetic Parameters 

[0097] Under the assay conditions as described in Reference Example 2, the Km and Vmax values of this enzyme 
toward the GnT-M product type oligosaccharides were 0.73 mM and 3.23 M M/min, respectively, and these values toward 
the GnT-V product type oligosaccharides were 0.1 3 mM and 1 .75 M M/min, respectively. The Km value toward the UDP- 
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GlcNAc was 0.22 mM. 

[0098] Among the pyridy laminated oligosaccharides obtained, those represented by the following formulas were 
found to be novel oligosaccharides: 



Man a 1 

GlcNAc 01 A t Maa / 5l - 4GteNAc ^ l "^ GlcNAc " PA 
Man a i 

GlcNAc 3 1 ^ 2 

10 

and 



I5 Gal 3 1—4 GlcNAc 3 i— 2Man a 1^ 

GlcNAc 3 1 a 3 Man 0 l— 4GlcNAc 3 l~4GlcNAc-PA 

^ Man a 
GlcNAc 3 1 ^ 



(viii) Action on Glycoproteins 

25 [0099] In order to demonstrate that GnT-IV can act on not only oligosaccharide substrates but also oligosaccharide 
chains on glycoproteins, GnT-IV reacts on asialo agalacto glycoproteins using UDP-[ 14 C] GlcNAc as a sugar donor. 
Then, the reaction products were analyzed by SDS-PAGE and fluorography (Panels A and B. Fig. 17). As shown in 
lanes 2 and 5 of Panel B in Fig. 17, transfer of [ 14 C] GlcNAc to asialo agalacto human transferrin and asialo agalacto. 
CHO cell-derived recombinant human erythropoietin ' 

30 [0100] The human transferrin having the GnT-IV product type sugar chain (of the following formula) obtained by this 
GnT-IV reaction is a novel substance which does not occur in nature. 



GlcNAc 0 l — 2Mana^ 

GlcNAc 3 4x „ A™** P 1 ~ 4GlcNAc 3 GlcNAc- 

GlcNAc (3 V 2 

GnT-IV product type sugar chain structure 

40 



[EXAMPLE 2] Peptide Mapping Analysis 

45 

[0101] About 1 mg of this enzyme of the invention finally purified was electrophoresed on 0.1% SDS-10% polyacry- 
lamide gel according to the method of Laemmli [Laemmli, U.K. Nature (1970) 313, 756-762]. The separated proteins 
were electroblotted onto a PVDF membrane. The protein fixed on the membrane was S-carboxymelhylated and then 
digested with lysylendopeptidase Achromobacter protease I (AP-I) (Wako Pure Chemical Industries, Ltd) to obtain an 
so AP-l-digested fragment mixture. The AP-l-digested PVDF membrane was further digested with Asp-N (Takara Shuzo) 
to obtain an Asp-N-digested fragment mixture. Each of the peptide fragment mixtures was separated by high perform- 
ance liquid chromatography and subjected to amino acid sequence analysis. As a result, the sequences shown in SEQ 
ID NOS: 1-14 were obtained. 

55 
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[EXAMPLE 3] Isolation and Identification of Bovine GnT-IVa cDNA 

(1) RT-PCR 

s [0102] Based on the amino acid sequences shown in SEQ ID NOS: 7 and 1 1 obtained in Example 2, oligomer AP- 
5F shown in SEQ ID NO: 15 and oligomer DN-9R shown in SEQ ID NO: 16 were synthesized respectively. An RT-PCR 
was performed using as a template the total RNA extracted from bovine small intestine tissue by the guanidium isothi- 
ocyanate method and using the above primers. As a result, an amplified fragment of about 170 bp which seemed spe- 
cific was obtained. This fragment was subcloned. 

10 

(2) Screening of a Library 

[01 03] A bovine small intestine cDNA library (Clontech, USA) was screened with the above-mentioned RT-PCR prod- 
uct to obtain four positive plaques. The nucleotide sequences of these clones were determined. The resultant 
75 sequences contained a number of nucleotide sequences coding for some of the partial amino acid sequences (SEQ ID 
NOS: 1-14) obtained in Example 2, and also contained a sequence appearing to be a termination codon. Using a frag- 
ment of 150 bp representing the most upstream region of the resultant nucleotide sequence, the library was screened 
again to obtain two positive plaques. The nucleotide sequences of these clones were determined. Then, the library was 
further screened similarly with a probe of 150 bp of the most upstream region, however, new clones were not obtained. 

20 

(3) 5' RACE (Rapid Amplification of the cDNA Ends) 

[0104] Subsequently, 5' RACE was performed in order to obtain a full-length cDNA. Using the sequence of the most 
upstream region obtained by the phage screening, the first 5' RACE was performed. However, an initiation codon could 

25 not be found. Then, based on the sequence obtained by the first 5* RACE, the second 5' RACE was performed to 
thereby obtain a sequence containing an initiation codon. This sequence was ligated to the previously obtained partial 
gene sequence of the phage clone to thereby obtain a gene fragment containing an intact open reading frame (Gene 
1). The nucleotide sequence for the thus obtained gene fragment is shown in SEQ ID NO: 17, and the amino acid 
sequence deduced therefrom is shown in SEQ ID NO: 18. It was confirmed that this DNA fragment contains all of the 

30 nucleotide sequences coding for the 14 partial amino acid sequences (SEQ ID NOS: 1-14) obtained in Example 2. 

[EXAMPLE 4] Construction of an Expression Vector using the Cloned Bovine GnT-IVa Gene and a Method for Produc- 
ing GnT-IVa Enzyme 

35 (1) Construction of a Vector 

[0105] A primer (SEQ ID NO: 19) which introduces an Xhol site into a region upstream of the initiation codon of Gene 
1 and another primer (SEQ ID NO: 20) which introduces an Xbal site into a region downstream of the termination codon 
of the gene were synthesized. Then, the entire gene coding for GnT-IV enzyme was amplified by PCR with the primers. 
40 The amplified fragment obtained was digested with Xhol and Xbal. and inserted between the Xhol and Xbal sites of 
pSVL vector (Pharmacia, Sweden) to prepare plasmid pBGT4. 

(2) Introduction into COS7 Cells 

45 [0106] Plasmid pBGT4 was introduced into COS7 cells (RIKEN Cell Bank) by electroporation. Briefly. 10 ng of the 
plasmid was added to approximately 5x1 0 6 cells in 0.8 ml of PBS(-) (Nissui Pharmaceutical Co.). A voltage of 1600 V 
was applied at a capacitance of 25 fiF with a gene pulser (BioRad, USA) at room temperature to introduce the gene into 
the cells. The resultant cells were transferred to a 90 mm laboratory dish and cultured in 10 ml of Dulbeccos modified 
Eagle's medium (Base Catalogue No. 12430. Life Technologies, Inc., USA) containing 10% fetal bovine serum under 

so 5% C0 2 at 37°C for 72 hours. Thereafter, the cells were recovered and suspended in 100 pi of a buffer (5 mM Tris-HCI, 
pH 7.5, 2 mM MgCI 2 , 1 mM phenylmethylsulfonyi fluoride), followed by sonication and centrifugation at 2000 x g for 5 
minutes. Thus, a cell extract was obtained. 

(3) Assay of GnT-IV Activity 

55 

[0107] GnT-IV activity in the cell extract was determined by the method described in Reference Example 2. The 
results are shown in Table 6. Compared to the extract from cells into which pSVL vector was introduced as a control, 
the extracts into which plasmid pBGT4 was introduced exhibited 44-78 times higher GnT-IV activity per cell. From these 
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results, it was confirmed that Gene 1 encodes GnT-IV enzyme. Thus, GnT-IV enzyme can be produced by cultured cells 
according to this method. 



Table 6 



Plasmid 


Specific Activity 
(pmol/hr/mg protein) 


Activity Ratio 


pSVL 


409 


1 


pBGT4(#1) 


29623 


72 


pBGT4(#2) 


31773 


78 


pBGT4(#3) 


20182 


44 


Reaction time: 4 hours 

Activity ratios are expressed in relation to the total activity 
of pSVL that is regarded as 1 . 



[EXAMPLE 5] Isolation and Identification of Human GnT-IVa cDNA 

2C 

(1) RT-PCR 

[0108] Based on the nucleotide sequence of bovine GnT-IVa obtained in Example 3, primer h1-2F shown in SEQ ID 
NO: 21 and primer h1-1R shown in SEQ ID NO: 22 were synthesized. Using total RNA from human liver (Clontech, 
25 USA) as a template, an RT-PCR was performed with the above primers. As a result, an amplified fragment of about 650 
bp which seemed to be specific was obtained. This fragment was subcloned. and the nucleotide sequence thereof was 
determined. 

(2) Screening of a Library 

30 

[0109] A cDNA library from human liver (Clontech, USA) was screened using the 685 bp DNA fragment obtained by 
the above RT-PCR as a probe. Two positive plaques of hGT4/A. gt10-1 and hGT4A. gt10-2 were obtained. The nucle- 
otide sequences of the inserts in these phage clones were determined. As a result, hGT4/X gtl0-1 contained a 804 bp 
DNA region and hGT4/X gt10-2 contained a 21 15 bp DNA region. The former region was entirely included in the latter 
35 region. As shown in SEQ ID NO: 23. the DNA fragment contained in hGT4/X gt10-2 had an open reading frame (ORF) 
highly homologous to the amino acid sequence of bovine GnT-IVa (96% identical). From the results described in Exam- 
ple 6. it was confirmed that this ORF is human GnT-IVa gene. The amino acid sequence of this ORF is shown in SEQ 
ID NO: 24. 

40 [EXAMPLE 6] Construction of an Expression Plasmid for Human GnT-IVa Gene and a Method for Producing Human 
GnT-IVa Enzyme 

(1) Construction of Expression Plasmid pHGT4-1 for Human GnT-IVa Gene 

45 [0110] A primer (M-7F; SEQ ID NO: 25) which introduces an Xhol site into a region upstream of the initiation codon 
of human GnT-IVa gene and another primer (h1-7R; SEQ ID NO: 26) which is complementary to a region downstream 
of the termination codon of the gene were synthesized. Using RNA from human liver (Clontech, USA) as a template, 
the entire gene coding for human GnT-IVa enzyme was amplified by RT-PCR with the above primers. The resultant 
amplified fragment was inserted in the Srfl site of plasmid pCRScript Amp SK(+) (Stratagene, DNA) in the opposite 

so direction to the transcription of lacZ gene. Using the resultant plasmid, it was confirmed by nucleotide sequence analy- 
sis that the amplified fragment encodes the amino acid sequence shown in SEQ ID NO: 24. Further, this plasmid was 
digested with Xhol and Sacl to obtain an Xhol-Sacl 1 .7 kb fragment. This fragment was inserted bewteen the Xhol and 
Sacl sites of pSVL vector (Pharmacia. Sweden) to prepare an expression plasmid pHGT4-1 for human GnT-IVa gene. 

55 (2) Introduction of Human GnT-IVa Gene into COS7 Cells 

[0111] Plasmid pHGT4-l was introduced into COS7 cells by electroporation. The resultant cells were cultured under 
10% C0 2 at 37°C for 72 hours. Then, the cells were harvested, suspended in 100 pi of a buffer (5 mM Tris-HCl, pH 7.5, 
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2 mM magnesium chloride, 1 mM phenyl methylsulfonyl fluoride), disrupted by sonication, centrifuged at 2000xg for 5 
minutes and collect supernatant to obtain a cell extract. 

(3) Expression of Human GnT-IVa Gene in COS7 Cells 

5 

[0112] GnT-IV activity in the cell extract was determined by the method described in Reference Example 2. The 
results are shown in Table 7. Compared to the extract from cells into which pSVL vector was introduced as a control, 
the extracts from cells into which plasmid pHGT4-1 was introduced exhibited 21 -28 times higher GnT-IV activity per cell. 
From these results, it was confirmed that GnT-IVa gene shown in SEQ ID NO: 23 encodes the glycosyltransferase GnT- 
io IV It was also confirmed that human GnT-IVa enzyme can be produced by cultured cells according to this method. 



Table 7 



Plasmid 


Specific Activity 
(pmol/hr/mg protein) 


Activity Ratio 


pSVL 


1037 


1 


pHGT4-1(#1) 


28951 


28 


pHGT4-1(#2) 


21788 


21 


pHGT4-2(#1) 


11024 


11 


pHGT4-2(#2) 


8029 


8 


Reaction time: 1.3 hours 

Activity ratios are expressed in relation to the total activity 
of pS VL that is regarded as 1 . 



[EXAMPLE 7] Isolation and Identification of Human GnT-IVb cDNA 

30 

(1) Acquisition of Human GnT-IVa-like Gene by PCR, RT-PCR and 5'RACE (Rapid Amplification of cDNA Ends) 

[011 3] Nucleotide sequences having similarity to the nucleotide sequence of human GnT-IVa gene obtained in Exam- 
ple 3 were searched for in the DNA database GenBank by BLASTN. As a result, Accession Numbers R12057, H10557 

35 and W16571 were found out. Then, primer h2-45F shown in SEQ ID NO: 27 and primer h2-43R shown in SEQ ID NO: 
28 were synthesized to perform a PCR using a cDNA library from human brain of Quick Screen Human cDNA Library 
Panel (Clontech, USA) as a template. The amplified fragment was subcloned into the Srf I site of pCRScript Amp SK(+) 
(Stratagene, USA) and subjected to analysis of the nucleotide sequence. Also, primer h2-2F shown in SEQ ID NO: 29 
and primer h2-1 R shown in SEQ ID NO: 30 were synthesized to perform an RT-PCR using total RNA from human lung 

40 (Clontech, USA) as a template. As a result, an amplified fragment of about 500 bp of the expected size was obtained. 
Then, this fragment was subcloned into the Srf I site of pCRScript Amp SK(+) (Stratagene, USA) and subjected to anal- 
ysis of the nucleotide sequence. 

[01 1 4] The thus obtained nucleotide sequences of the two DNA fragments were overlapping with each other forming 
a region of 1006 bp. In this region, one reading frame which encodes the homologous amino acid sequences to those 
45 of bovine and human GnT-IVa was recognized. Thus, the existence of a protein relating to GnT-IVa proteins was sug- 
gested. 

[0115] Then, possible nucleotide sequences which may be an upstream sequence to R12057 or a downstream 
sequence to W1 6571 were searched for in the DNA database GenBank by BLASTN. As a result R1 5554 was found as 
an upstream sequence to R12057, and W16466 as a downstream sequence to W16571 . However, a apparently inap- 

so propriate termination codon was contained in the ORFs deduced from these nucleotide sequences. Therefore, in order 
to confirm the nucleotide sequences, DNA fragments were obtained by RT-PCR. As primers, h2-1F shown in SEQ ID 
NO: 31 , h2-3F shown in SEQ ID NO: 32 and h2-8R shown in SEQ ID NO: 33 were synthesized. With a combination of 
h2-1 F and the h1-1 R described in Example 5, or a combination of h2-3F and h2-8R, an RT-PCR was performed using 
total RNA from human liver (Clontech. USA) as a template. Amplified fragments of about 550 bp and about 300 bp, both 

55 coinciding with the expected sizes, were detected. Each of these fragments was subcloned in the Srf I site of pCRScript 
Amp SK(+) to analyze the nucleotide sequence thereof. As a result, it was confirmed that these fragments are respec- 
tively overlapping with an upstream region and a downstream region to the 1006 bp region between h2-45F and h2-1 R 
mentioned above. In the ligated region of 1361 bp, an ORF was found which encodes 433 amino acids protein having 
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high similarity to the amino acid sequences of bovine and human GnT-IVa proteins. 

[01 1 6] However, when this ORF is compared to the amino acid sequences of GnT-IVa proteins, it was presumed that 
the starting methionine should be present in a region upstream to this ORF. Therefore, the upstream region was 
obtained by S'-RACE using Human Lung 5'-RACE-Ready cDNA (Clontech, USA). In the first PCR. an anchor primer 
and h2-5R shown in SEQ ID NO: 34 were used as primers. In the second PCR, an anchor primer and h2-3R shown in 
SEQ ID NO: 35 were used as primers. The fragments obtained by 5 -RACE were purified, digested with EcoRI and Pstl, 
and then separated by agarose gel electrophoresis. A fragment of about 450 bp was recovered from the gel. This frag- 
ment was inserted between the EcoRI and Pstl sites of pUC1 8 vector (Pharmacia, Sweden) to analyze the nucleotide 
sequence thereof. As a result, it was confirmed that this fragment is overlapping with a region upstream of the region 
between h2-1 F and h2-8R. In the ligated region of 1758 bp, one ORF was confirmed which encodes 548 amino acids 
protein having high similarity to the amino acid sequences of bovine and human GnT-IVa proteins. The nucleotide 
sequence of this ORF is shown in SEQ ID NO: 36, and the amino acid sequence thereof in SEQ ID NO: 37. From the 
results described in Example 8 below, it was confirmed that this gene is human GnT-IVb gene. 

[EXAMPLE 8] Construction of an Expression Plasmid for Human GnT-IVb Gene and a Method for Producing Human 
GnT-IVb Enzyme 

(1) Construction of Expression Plasmid pHGT4-2 for Human GnT-IVb Gene 

j.*. * . .. /u.j-k a. r*r~rm in on\....u:.u VU^I jr»**rv or*^tr»o iioetroom.r\f >ho. initiation. rinHnn r»f 

jui 1 /j m primtji (n^-H . ocw iu nu. oo } wi hw i ■■ m uuuuco 01 1 /-xi » «no " >^ « > ■ m^-ww . . »■ ■ - • 

human GnT-IVb gene, and another primer (h2-10R: SEQ ID NO: 39) which introduces an Xbal site in a region down- 
stream to the termination codon of the above gene were synthesized. Using these primers, the entire ORF coding for 
human GnT-IVb enzyme was amplified by RT-PCR with RNA from human lung (Clontech, USA) as a template. The 
amplified fragment was inserted into the Srfl site of plasmid pCRScript Amp SK(+), followed by determination of the 
nucleotide sequence thereof. As a result, it was confirmed that the amplified fragment is coding for the amino acid 
sequence of SEQ ID NO: 37. Further, this plasmid was digested with Xhol and Xbal to obtain an Xhol-Xbal 1 .7 kb frag- 
ment. This fragment was inserted between the Xhol and Xbal sites of pSVL vector (Pharmacia. Sweden) to construct 
an expression plasmid pHGT4-2 for human GnT-IVb gene. 

(2) Introduction of Human GnT-IVb Gene into COS7 Cells 

[0118] Plasmid pHGT4-2 was introduced into COS7 cells by electroporation. The resultant cells were cultured under 
10% C0 2 at 37°C for 72 hours. Then, the cells were recovered, suspended in 100 \i\ of a buffer (5 mM Tris-HCI, pH 7.5, 
2 mM magnesium chloride, 1 mM phenylmethylsulfonyl fluoride), disrupted by sonication, centrifuged at 2000xg for 5 
minutes and collect supernatant to thereby obtain a cell extract. 

(3) Expression of Human GnT-IVb Gene in COS7 Cells 

[0119J GnT-IV activity in the cell extract was determined by the method described in Reference Example 2. The 
results are shown in Table 7 above. Compared to the extract from cells into which pSVL vector was introduced as a con- 
trol, the extracts from cells into which plasmid pHGT4-2 was introduced exhibited 8-1 1 times higher GnT-IV activity per 
cell. From these results, it was confirmed that the GnT-IVb gene shown in SEQ ID NO: 36 encodes the glycosyltrans- 
ferase GnT-IV. It was also confirmed that human GnT-IVb enzyme can be produced by cultured cells according to this 
method. 

[EXAMPLE 9] Construction of Expression Plasmids for Bovine GnT-IVa N-Terminal Deletion Mutants and their Expres- 
sion of 

(1) Construction of Expression Plasmids pSiglle93, pSigPro113 and pSigPro142 for Bovine GnT-IVa 

[0120] A primer (XhoEsig: SEQ ID NO: 40) which introduces an Xhol site into a region upstream of the signal 
sequence of human erythropoietin (GenBank Accession Number X02157) and an antisense primer (E4-1R: SEQ ID 
NO:41 ) which ligates the C-terminus of the above signal sequence to a part of the bovine GnT-IVa amino acid sequence 
spanning from position 93 (lie) to the end were synthesized to amplify the signal sequence of human erythropoietin by 
PCR. Also, a sense primer (E4-1 F: SEQ ID NO: 42) corresponding to the above antisense primer and a primer (4EXPR: 
SEQ ID NO:20) which introduces an Xbal site in a region downstream of the termination codon of bovine GuT-IVa gene 
were synthesized to amplify a partial sequence of bovine GnT-IVa gene by PCR. Using portions of the resultant two 
PCR products as a mixed template, a PCR was performed with primers XhoEsig and 4EXPR. The amplified fragment 
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was digested with Xhol and Xbal and inserted between the Xhol and Xbal sites of pSVL vector (Pharmacia, Sweden), 
to thereby construct plasmid pSiglle93 that expresses an amino acid sequence in which the human erythropoietin sig- 
nal is linked to a part of the bovine GnT-IVa amino acid sequence spanning from position 93 to the end. 
[0121] Plasmid pSigProl 1 3 that expresses an amino acid sequence in which the human erythropoietin signal is linked 
5 to a part of the bovine GnT-IVa amino acid sequence spanning from position 113 (Pro) to the end; or plasmid 
pSigProl42 that expresses an amino acid sequence in which the human erythropoietin signal is linked to a part of the 
bovine GnT-IVa amino acid sequence spanning from position 142 (Pro) to the end was constructed respectively in the 
same manner as described above using E4-2R primer (SEQ ID NO:43) or E4-3R primer (SEQ ID NO: 44) instead of 
E4-1 R primer; and E4-2F primer (SEQ ID NO:45) or E4-3F primer (SEQ ID NO:46) instead of E4-1F primer. 

10 

(2) Introduction of plasmids expressing Bovine GnT-IVa N-Terminal Deletion Mutants into COS7 Cells 

[0122] Plasmid pSiglle93, pSigProl 13 or pSigProl42 was introduced into COS7 cells by electroporation. The result- 
ant cells were cultured under 10% C0 2 at 37°C for 72 hours. Then, the cells and the culture supernatant were recov- 
15 ered separately The cells were suspended in 100 ul of a buffer (5 mM Tris-HCI, pH 7.5, 2 mM magnesium chloride, 1 
mM phenylmethylsulfonyl fluoride), disrupted by sonication and centrifuged at 2000xg for 5 minutes to thereby obtain a 
cell extract. The culture supernatant was concentrated to about 100 uJ with Centriplus 30 (Amicon). 

(3) Expression of Bovine GnT-IVa N-Terminal Deletion Mutants in COS7 Cells 

20 

[0123] GnT-IV activity in the culture supernatant and the cell extract was determined by the method described in Ref- 
erence Example 2. The results are shown in Table 8. Compared to the total activity (i.e., activity in cells + activity in 
supernatant) of the cells into which pBGT4 vector was introduced as a positive control, the total activity of the cells into 
which pSiglle93 was introduced was more than 30%. Furthermore, more than one third of the activity was secreted into 
25 the culture supernatant From these results, it was found that the amino acids from the N-terminus to position 92 of the 
bovine GnT-IVa amino acid sequence can be deleted while retaining the enzyme activity. It was also shown that GnT- 
IVa enzyme can be expressed secretively by using an appropriate secretion signal. 



30 Table 8 





Plasmid 


Fraction 


Activity (pmol/hr) 


Activity Ratio in Each 
Fraction (%) 


Total Activity Ratio (%) 




pSVL 


Supernatant 


136 


0.5 


1.9 


35 


pSVL 


Cells 


384 


1.4 






pBGT4 


Supernatant 


722 


2.7 


100.0 




pBGT4 


Cells 


26152 


97.3 




40 


pSiglle93 


Supernatant 


3106 


11.6 


31.9 




pSiglle93 


Cells 


5471 


20.4 






pSigPro113 


Supernatant 


312 


1.2 


3.4 




pSigPro113 


Cells 


606 


2.3 




45 


pSigProl42 


Supernatant 


219 


0.8 


2.2 




pSigPro142 


Cells 


381 


1.4 




50 


Reaction time: 2.5 hours 

The activity ratios are expressed in percent in relation to the total activity of pBGT4 that is regarded as 
100%. 



[EXAMPLE 1 0] Construction of Expression Plasmids for Bovine GnT-IVa C-Terminal Deletion Mutants and their Expres- 
sion 

55 

(1) Construction of Expression Plasmids pCGIy499, pCPro465, pCLys432 and pCPro383 for Bovine GnT-IVa 

[01 24] A primer (SEQ ID NO: 1 9) which introduces an Xhol site into a region upstream of the initiation codon of bovine 
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GnT-IVa gene and a primer (CGIy499: SEQ ID NO:47) which ligates the termination codon after the Gly codon at posi- 
tion 499 and introduce an Xbal site in a region downstream to the termination codon above were synthesized to amplify 
a partial sequence of bovine GnT-IVa gene by PGR. The amplified fragment was digested with Xhol and Xbal, and 
inserted between the Xhol and Xbal sites of pSVL vector (Pharmacia. Sweden). Thus, plasmid pCGIy499 which 

£ expresses the bovine GnT-IVa amino acid sequence up to position 499 (Glycine) was constructed. Using CPro465 
primer (SEQ ID NO:48). CLys432 primer (SEQ ID NO:49) or CPro383 primer (SEQ ID NO:50) instead ol CGIy499 
primer, three additional plasmids were constructed in the same manner. They were designated pCPro465 (plasmid 
which expresses the bovine GnT-IVa amino acid sequence up to position 465 (Proline)); pCLys432 (plasmid which 
expresses the bovine GnT-IVa amino acid sequence up to position 432 (Lysine)); and pCPro383 (plasmid which 

w expresses the bovine GnT-IVa amino acid sequence up to position 383 (Proline)), respectively. 

(2) Introduction of plasmids expressing Bovine GnT-IVa C-Terminal Deletion Mutants into COS7 Cells 

[0125] Plasmid pCGIy499. pCPro465, pCLys432 or pCPro383 was introduced into COS7 cells by electroporation. 
is The resultant cells were cultured under 10% C0 2 at 37°C for 72 hours. Then, the cells were recovered and suspended 
in 100 pi of a buffer (5 mM Tris-HCl, pH 7.5, 2 mM magnesium chloride, 1 mM phenylmethylsulfonyl fluoride), disrupted 
by sonication and centrifuged at 2000xg for 5 minutes to thereby obtain a cell extract. 

(3) Expression of Bovine GnT-IVa C-Terminal Deletion Mutants in COS7 Cells 

20 

[0126] GnT-IV activity in the cell extract was determined by the method described in Reference Example 2. The 
results are shown in Table 9. Compared to the GnT-IV activity of the extract from cells into which pBGT4 vector was 
introduced as a positive control, that activity of the extract from cells into which pCGIy499, pCPro465, pCLys432 or 
pCPro383 was introduced was 15.2%. 12.1%, 2.8% or 104.2% per cell, respectively. From these results, it was shown 
25 that GnT-IV activity can be retained even if the amino acids from position 384 to the C-terminus in the bovine GnT-IVa 
amino acid sequence are deleted. 



Table 9 
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Plasmid 


Specific Activity 
(pmol/hr/mg protein) 


Activity Ratio (%) 




pSVL 


77 


1 




pBGT4 


14917 


100 


35 


pCGIy499 


2263 


15 




pCPro465 


1798 


12 




pCLys432 


410 


3 


40 


pCPro383 


15551 


104 




Reaction Time: 2 hours 

Activity ratios are expressed in percent in relation to the total 
activity of pBGT4 which is regarded as 1 00%. 



45 

[EXAMPLE 1 1] Construction of Plasmids to Express various GnT-IV Genes in E. coli and their Expression 
(1) Construction of E. coli Expression Plasmid for Bovine GnT-IVa 

so [0127] A primer (BSP-N: SEQ ID NO:51) which introduces a BspHI site in a region upstream of the initiation codon 
of bovine GnT-IVa gene and another primer (C-Hd: SEQ ID NO:52) which introduces a Hind\\\ site in a region down- 
stream of the termination codon were synthesized to amplify the entire open reading frame of bovine GnT-IVa gene by 
PCR. The amplified fragment was digested with SspHI and Hind\\\, and introduced between the Nco\ and Hind\\\ sites 
of pTrc99A vector (Pharmacia, Sweden) to thereby construct plasmid pEBGT4. Using BSP-sN primer (SEQ ID NO:53) 

55 instead of BSP-N primer together with C-Hd primer, plasmid pElle93 was constructed in a similar manner. Further, a 
gene coding for the open reading frame in which His-Tag is added to the C-terminus was amplified using BSP-N primer, 
a primer (CH-Hd: SEQ ID NO:54) which can introduce His-Tag. a termination codon and a Hind\\\ site in a region down- 
stream of the C-terminus of bovine GnT-IVa gene and a primer (H-Hd: SEQ ID NO:55) which has His-Tag. a termination 
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codon and a Hind\\\ site, to thereby construct plasmid pEBGT4+His in a similar manner. 

(2) Construction of E. coli Expression Plasmid for Human GnT-IVa Gene and Human GnT-IVb Gene 

[0128] A primer (4aBSPIL94: SEQ ID NO:56) which introduces an initiation codon and an He codon in a region 
upstream of position 94 (Leu) of the human GnT-IVa amino acid sequence and which can also introduce a SspHI site 
at a region further upstream thereof; a primer (4aCH-Hd: SEQ ID NO: 57) which introduces His-Tag, a termination 
codon and a Hind\\\ site in a region downstream of the C-terminal amino acid; and H-Hd primer were synthesized to 
amplify a gene fragment composed a partial sequence of the human GnT-IVa amino acid sequence to which a 
sequence encoding His-Tag is added. The amplified fragment was digested with SspHI and HindUl and inserted 
between the Nco\ and Hind\\\ sites of pTrc99A vector (Pharmacia, Sweden) to thereby construct plasmid pMA4a+His. 
Further, using CP383H-Hd primer (SEQ ID NO:58) instead of 4aCH-Hd primer, plasmid pCore+His was constructed in 
a similar manner (Fig. 1 8). A fragment of human GnT-IVb gene was amplified using a primer (4bBSP-N: SEQ ID NO:59) 
which introduces a SspHI site in a region upstream of the initiation codon of human GnT-IVb gene and 4bSACR primer 
(SEQ ID No: 60), digested with SspHI and Sacl, and then inserted between the Nco\ and Sacl sites of pTrc99A vector 
(Pharmacia, Sweden). Between the Sacl and Hind\\\ sites of the resultant plasmid, a partial length of human GnT-IVb 
gene amplified using 4bSACF primer (SEQ ID NO:61), a primer (4bCH-Hd: SEQ ID NO: 62) which introduces His-Tag 
at the C-terminus of the human GnT-IVb amino acid sequence and H-Hd primer, and digested with Sacl and Hind\\\ 
was inserted to thereby achieve plasmid pEHGT4-2+His. Further, a partial sequence of human GnT-IVb gene was 
amplified using a primer (4bNCOG91 : SEQ ID NO:63) which introduces an Nco\ site and an initiation codon in a region 
upstream of position 91 (Gly) of the human GnT-IVb amino acid sequence, 4bCH-Hd primer and H-Hd primer, digested 
with Nco\ and Hind\\\, and then inserted between the Nco\ and Hind\\\ sites of pTrc99A vector (Pharmacia, Sweden) to 
thereby construct plasmid pMA4b+His. 

(3) Introduction of Each Expression Plasmid into E. coli BL21 Strain 

[0129] Each expression plasmid was introduced into competent cells of E. coli BL21 strain prepared by the calcium 
method. The resultant cells were cultured on LB agar plate containing 100 ^g/ml ampicillin. The resultant colonies of E. 
coli transformed with each plasmid were inoculated into LB liquid medium and cultured under shaking at 37°C over- 
night. Then, the culture was inoculated into a fresh LB liquid medium to give a concentration of 2%. While the turbidity 
(OD 595 nm) of the culture fluid was about 0.1 to 0.2, IPTG (isopropyl b-D-thiogalactopyranoside) was added thereto 
to give a final concentration of 1 mM. The cells were cultured at 37°C for 2 hours or at 25°C for 4 hours. Then, 500 \i\ 
of the cells was harvested. The cell pellet was suspended in 50 *il of a buffer (5 mM Tris-HCI, pH 7.5, 2 mM MgCI 2 , 1 
mM phenylmethylsulfonyl fluoride), disrupted by sonication and centrifuged at 2000 x g for 5 minutes to obtain a cell 
extract as a supernatant. 

(4) Expression of Each Expression Plasmid in E. coli BL21 Strain 

[0130] GnT-IV activity in the cell extract was determined by the method described in Reference Example 2. Table 10 
shows the results of the expression of the bovine gene. Although the extract from E. coii cells into which pTrc99A vector 
was introduced as a control had little GnT-IV, the extract from E. coli cells into which pEBGT4 was introduced had def- 
inite GnT-IV activity. From these results, it was demonstrated that GnT-IV enzyme can be produced by E. coii. The His- 
tag sequence added to the C-terminus of bovine GnT-IVa did not influence greatly upon GnT-IV activity. Thus, it was 
shown that an appropriate tag sequence can be added to GnT-IV enzyme. The mutant (pElle93) in which the N-terminal 
92 amino acid are deleted exhibited stronger GnT-IV activity. Thus, the expression of variants of GnT-IV enzyme which 
was confirmed in animal cells was also shown possible in E. coli. 



Table 10 



Plasmid 


Activity (pmol/hr/mg pro- 


Activity Ratio (%) 




tein) 




pTrc99A 


0 


0 


pEBGT4 


4611 


100 


pEBGT4+His 


3090 


67 
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Table 10 (continued) 



Plasmid 


Activity (pmol/hr/mg pro- 


Activity Ratio (%) 




tein) 




pElle93 


5841 


127 


Reaction time: 3.0 hours 




After IPTG addition, cells were cultured at 37°C for 2 hours. Activity ratios 


are expressed in percent in relation to the total activity of pEBGT4 which is 


regarded as 1 00%. 







[0131] Table 1 1 shows the results of the expression of the human gene. Compared to the extract from E. coli cells 
into which pTrc99A vector was introduced as a control, the extract from E. coli cells into which any of the expression 
plasmids was introduced had GnT-IV activity significantly. As shown in bovine GnT-IVa enzyme, it was also possible in 
human GnT-IVa and GnT-IVb enzymes to delete an N-terminal sequence white retaining the activity Further, the human 
GnT-IVa enzyme which has both N-terminal deletion and C-terminal deletion exhibited high GnT-IV activity (pCore+His). 
This shows that those portions deleted in this mutant are not essential for GnT-IV activity. 



Table 11 





Plasmid 


Activity (pmol/hr/mg pro- 
tein) 


Activity Ratio (%) 




pTrc99A 


0 


0 


25 


pEBGT4+His 


21390 


637 




pMA4a+His 


3359 


100 




pCore+His 


39766 


1184 


30 


pEHGT4-2+His 


270 


8 


pMA4b+His 


2812 


84 


35 


Reaction time: 4.0 hours 

After IPTG addition, cells were cultured at 25°C for 4 hours. 

Activity ratios are expressed in percent in relation to the total activity of 

pMA4a+His which is regarded as 100%. 



[EXAMPLE 12] Conversion of the Sugar Chain Branching Structure of a Recombinant Erythropoietin (EPO) by Intro- 
ducing Bovine or Human GnT-IVa Gene into EPO-Producing CHO Cells 

AO 

(1) Introduction of GnT-IV Expression Plasmid into EPO-Producing CHO Cells 

[0132] EPO-producing CHO cell clones were created according to the method disclosed in Japanese Examined Pat- 
ent Publication No. 2-17156. GnT-IVa expression plasmid pBGT4 or pHGT4-1 was introduced into the resultant cell 
clones MOI and H-5 by electroporation. In the introduction, 1 5 pg of the expression plasmid and 1 .5 pg of a drug resist- 
ance marker plasmid (pSV2bsr from Kaken Pharmaceutical or pMAMneo from Clontech) were used in mixture. The 
electroporated cells were cultured under 10% C0 2 at 37°C for about 60 hours. Then, blasticidin S (Kaken Pharmaceu- 
ticals) (final concentration: 10 ^g/ml) or geneticin (Life Technologies, Inc.) (final concentration: 500 ng/ml) was added 
to the medium, in which the cells were cultured for another 10 days to 2 weeks. Thus, clones resistant to either of two 
drugs were isolated. 

(2) Confirmation of the Expression of the Introduced GnT-IV Genes in EPO-Producing CHO Cell Clones 

[01 33] EPO-producing CHO cell clones (initial clones) and individual drug resistant clones were cultured in an appro- 
55 priate scale. Total RNA from each clone was purified. Then, RNA dot blot analysis was performed using a part of GnT- 
IVa gene as a probe to thereby examine the amount of GnT-IVa mRNA. Further. GnT-IV activity expressed in the initial 
clones and the drug resistant clones was determined by the assay described in Reference Example 2. Those clones 
which gave a strong signal in RNA dot blot analysis and yet exhibited higher GnT-IV activity than the initial clones were 
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selected and used for EPO production. The selected clones had increased GnT-lV activity; for example, MOI (bovine 
GnT-IV)#36 exhibited about 104-fold increase over M01 clone, and H-5(human GnT-IV)#23 exhibited about 125-fold 
increase over H-5 clone. 

5 (3) Production of EPO using GnT-IV Gene- Introduced EPO-Producing CHO Cell Clones 

[0134] EPO is expressed secretively into culture fluid. Then, EPO-producing CHO cell clones M01 and H-5, and the 
above-mentioned clones MOI (bovine GnT-IV)#36 and H-5(human GnT-IV)#23 were cultured in roller bottles. First, 
each clone was adhesion-cultured in a growth medium, and then 1.5 x 10 7 cells were transferred to a 850 cm 2 roller 

10 bottle containing 200 ml of a growth medium. The cells were cultured under 10% C0 2 at 37°C for 3 days so that they 
adhered to the bottle uniformly. Thereafter, the growth medium was removed, and the cells were washed with PBS 
buffer. Then, 200 ml of a serum-free medium was added to the bottle, in which the cells were cultured under 10% C0 2 
at 37°C for 7 days. Thereafter, the culture supernatant was recovered. As a growth medium, D-MEM/F12 mixed 
medium supplemented with 5% fetal bovine serum, 290 mg/lrter L-glutamic acid, IxMEM non-essential amino acid 

is solution and 100 nM methotrexate was used. As a serum-free medium, the above medium without fetal bovine serum 
was used. EPO contained in each of the serum-free culture super natants was quantitatively determined by ELISA using 
anti -human EPO antibody. 

(4) Analysis of EPOs Produced by GnT-IV Gene-Introduced or -Non-Introduced Clones Based on Their Sugar Chain 
20 Structures 

[0135] A recombinant EPO does not exist as a single molecule on isoelectric focusing gel; it is mixture of molecules 
with various electric charges. Since the protein moiety does not vary, it has been shown that the difference in electric 
charge among these molecules is based on the difference in sugar chain structure; such mixture of molecules is called 

25 glycoforms [Watson, E. and Yao, F.. Anal. Biochem. (1993), 210, 389-93]. EPO has three Asn-linked sugar chains; the 
branching structures of individual sugar chains vary from biantennary to tetraantennary. Gal (galactose) is further 
attaching to the end of each of branched GlcNAc's. and sialic acid is further attaching to this Gal. Therefore, if the 
degree of sugar chain branching is increased by the introduction of GnT-IV gene, the number of sialic acid molecules 
attaching to Gal should increase and, thus, the content of glycoforms with low isoelectric point should increase. Then, 

30 the inventors performed analysis by isoelectric focusing to detect changes in the sugar chain structure of the EPOs pro- 
duced by GnT-IV gene-introduced EPO-producing cells. 

[0136] For the isoelectric focusing, Multiphor II equipment manufactured by Pharmacia was used. The gel was com- 
posed of 5% acrylamide (30: 0.8) and 1 .5% Pharmalyte 2.5-5 (Pharmacia). As the (+) electrode solution, 0. 1 M sulfuric 
acid was used. As the (-) electrode solution, 0.2 M L-histidine was used. After the isoelectric focusing, samples were 

35 electrophoretically transferred onto a PVDF membrane, followed by Western blot analysis using anti-EPO mouse mon- 
oclonal antibody to detect the bands of individual glycoforms of the EPOs. Briefly, the serum-free culture supernatant 
of each cell clone was concentrated to about 7 to 1000-fold with Centriplus 30 and Microcon 30 (both manufactured by 
Amicon), if necessary. At the beginning, about 50-100 IU of EPO was used as a sample, but this amount was adjusted 
appropriately so that the intensities of bands detected by Western blot analysis would be almost equal between sam- 

40 pies. 

[0137] When the EPO from M01 clone was compared to the EPO from MOI (bovine GnT-IV)#36 clone, it was con- 
firmed that the positions of major glycoforms in the latter show a shift to the low pi side (+ electrode solution side) by at 
least one glycoform (Fig. 19). From this result, it is thought that the GnT-IV enzyme expressed as a result of the gene 
introduction increased the number of GlcNAc branches in the Asn-linked sugar chains attaching to EPO, thus increas- 
es ing the number of sialic acid molecules attaching to increase the content of glycoforms with low isoelectric points. A sim- 
ilar analysis was performed on H-5 clone and H-5(human GnT-IV)#23 clone. As a result, it was also found that the 
positions of major EPO glycoforms in the latter shift to the low pi side (Fig. 19). 

[0138] From the above, it was demonstrated that it is possible to modify the structure of Ans-linked sugar chains of 
the protein produced by the cell by introducing a GnT-IV gene into any cell. 

so 

Industrial Applicability 

[0139] According to the present invention, a novel (VI -» 4 N-acetylglucosaminyltransferase (GnT-IV), a method for 
producing the GnT-IV enzyme and a gene coding for the GnT-IV are provided. With the GnT-IV of the present invention, 
55 it has become possible to produce a glycocoujugate having a branching structure which could not be formed with con- 
ventional glycosyltransferases. Thus, the GnT-IV of the invention is useful not only for producing or improving glycocon- 
jugate type pharmaceuticals, reagents and foods, but also for modifying the sugar chain structure of any biopolymer. 
[0140] The GnT-IV gene of the invention is also useful for diagnosing or treating diseases such as cancer and for mod- 
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ifying the sugar chain structure of glycoconjugate products produced by microorganisms. 

[0141] Further, an antibody or anti-serum raised against the GnT-IV protein of the invention as an antigen, or a part 
or at) of the GnT-IV gene of the invention as a probe is useful for characterizing microorganisms, cultured cells, various 
animal tissues, blood cells and blood or for diagnosing diseased cells or tissues such as cancer. 

SEQUENCE LISTING 
SEQ ID NO: 1 
SEQUENCE LENGTH: 8 
SEQUENCE TYPE: amino acid 
TOPOLOGY : linear 
MOLECULE TYPE: peptide 



25 



30 



35 



SEQUENCE DESCRIPTION 
Asp Asn Leu Tyr Pro Glu Glu Lys 
5 

SEQ ID NO: 2 
SEQUENCE LENGTH: 11 
SEQUENCE TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 



SEQUENCE DESCRIPTION 

Asp Tyr Val Asn Gly Val Val Ala Asn Glu Lys 
AO 5 10 



SEQ ID NO: 3 
SEQUENCE LENGTH: 21 
SEQUENCE TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 

SEQUENCE DESCRIPTION 
55 

Glu lie Ser Ser Gly Leu Val Glu lie He Ser Pro Pro Glu Ser Tyr 



45 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



5 10 15 

Tyr Pro Asp Leu Thr 
20 

SEQ ID NO: 4 
SEQUENCE LENGTH: 8 
SEQUENCE TYPE: amino acid 
TOPOLOGY : 1 i nea r 
MOLECULE TYPE: peptide 

SEQUENCE DESCRIPTION 
Glu Arg Val Arg Trp Arg Thr Lys 
5 

SEQ ID NO: 5 
SEQUENCE LENGTH: 15 
SEQUENCE TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 

SEQUENCE DESCRIPTION 

Lys Gin Asn Leu Asp Tyr Cys Phe Leu Met Met Tyr Ala Gin Glu 
5 10 15 

SEQ ID NO: 6 
SEQUENCE LENGTH: 6 
SEQUENCE TYPE: amino acid 
TOPOLOGY; linear 
MOLECULE TYPE: peptide 
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SEQUENCE DESCRIPTION 
Asp His lie Leu Trp Val 

5 



W SEQ ID NO: 7 

SEQUENCE LENGTH: 14 

SEQUENCE TYPE: amino acid 
75 TOPOLOGY: linear 

MOLECULE TYPE: peptide 

SEQUENCE DESCRIPTION 

Lys He His Val Asn Pro Pro Ala Glu Val Ser Thr Ser Leu 



SEQ ID NO: 8 
30 SEQUENCE LENGTH: 10 

SEQUENCE TYPE: amino acid 
TOPOLOGY: linear 

35 

MOLECULE TYPE: peptide 



4Q SEQUENCE DESCRIPTION 

Lys Val Tyr Gin Gly His Thr Leu Glu Lys 
5 10 

AS 

SEQ ID NO: 9 
SEQUENCE LENGTH: 10 

50 

SEQUENCE TYPE: amino acid 
TOPOLOGY: linear 



55 
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MOLECULE TYPE: peptide 
SEQUENCE DESCRIPTION 

Asp Phe Phe Trp Ala lie Thr Pro Val Ala 



SEQ ID NO: 10 
75 SEQUENCE LENGTH: 6 

SEQUENCE TYPE: amino acid 
TOPOLOGY: linear 
20 MOLECULE TYPE: peptide 



25 



30 



35 



40 



SEQUENCE DESCRIPTION 
Asp Tyr lie Leu Phe Lys 
5 

SEQ ID NO: 11 
SEQUENCE LENGTH: 15 
SEQUENCE TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 



SEQUENCE DESCRIPTION 

Asp Lys Pro Val Asn Val Glu Ser Tyr Leu Phe His Ser Gly Asn 
45 5 10 



SEQ ID NO: 12 
SEQUENCE LENGTH: 10 
SEQUENCE TYPE: amino acid 



32 



EP 0 905 232 A1 



TOPOLOGY: linear 
MOLECULE TYPE: peptide 

SEQUENCE DESCRIPTION 

Asp He Leu Leu X Thr Thr Val Glu Val 
5 10 

SEQ ID NO: 13 
SEQUENCE LENGTH: 9 
SEQUENCE TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 

SEQUENCE DESCRIPTION 
Lys Ser Glu Gly Leu Asp He Ser Lys 
5 

SEQ ID NO: 14 
SEQUENCE LENGTH: 8 
SEQUENCE TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 

SEQUENCE DESCRIPTION 
Asp Gly Tyr Phe Arg He Gly Lys 
5 

SEQ ID NO: 15 
SEQUENCE LENGTH: 29 
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SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION 

AAR ATY CAY GTB AAY CCH CCH GCN GAR GT 29 
Lys lie His Val Asn Pro Pro Ala Glu Val 

SEQ ID NO: 16 

SEQUENCE LENGTH: 3 5 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

ANTI SENSE: Yes 

SEQUENCE DESCRIPTION 

TG RAA VAR RTA RSW YTC VAC RTT VAC DGG YTT RTC 
His Phe Leu Tyr Ser Glti Val Asn Val Pro Lys Asp 

SEQ ID NO: 17 
SEQUENCE LENGTH: 2246 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: double 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA 

SEQUENCE DESCRIPTION 
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GGCGGCTGCT CCGTGGCGGC TCGTCGGCGG CCGCGGCAGG ACTGGCAGCG CCGGCGGCGG 60 
GGAGAAAGAA GCATCCACCT ATGAAGACCG TGCAGACAGT CCTGAATAAT AATTGTGAAT 120 
GGTGTGGCTG CCAGACTAGT TCTGCTGAGC ATCTGAAATG AACCTCTCCT ATTGATTGTT 180 
TCAGTTGGCC CCGAGCCAGG AGTACTGGGT TTGCTTGACT TCAGGATAAA AAGAAACGGA 24 0 
CTTGGTTATC ATCGTAAACA TATGAACCAG TGTGATGGTG AAATGAG ATG AGG CTC 296 

Met Arg Leu 
1 

CGA AAT GGA ACT GTA GCC ACT GTT TTA GCA TTT ATC ACC TCG TTC CTC 344 
Arg Asn Gly Thr Val Ala Thr Val Leu Ala Phe He Thr Ser Phe Leu 

i 

5 10 15 

ACT TTA TCT TGG TAT ACA ACA TGG CAA AAT GGG AAA GAA AAA GTG ATT 392 
Thr Leu Ser Trp Tyr Thr Thr Trp Gin Asn Gly Lys Glu Lys Val He 
20 25 30 35 

GCT TAT CAA CGA GAA TTT CTT GCT CTG AAA GAA CGT CTC CGA ATA GCT 44 0 
Ala Tyr Gin Arg Glu Phe Leu Ala Leu Lys Glu Arg Leu Arg He Ala 

40 45 50 

GAA CAT CGA ATC TCT CAG CGC TCT TCT GAG CTC AGT GCC ATT GTA CAG 48 8 
Glu His Arg He Ser Gin Arg Ser Ser Glu Leu Ser Ala He Val Gin 

55 60 65 

CAA TTC AAG CGT GTA GAA GCA GAA ACA AAC AGG AGT AAG GAT CCA GTG 536 
Gin Phe Lys Arg Val Glu Ala Glu Thr Asn Arg Ser Lys Asp Pro Val 

70 75 80 

AAT AAA TTT TCA GAT GAT ACC CTA AAG ATA CTA AAG GAG TTA ACA AGC 584 
Asn Lys Phe Ser Asp Asp Thr Leu Lys He Leu Lys Glu Leu Thr Ser 

85 90 95 

AAA AAG TCT CTT CAA GTG CCA AGT ATT TAT TAT CAT TTG CCT CAT TTA 63 2 
Lys Lys Ser Leu Gin Val Pro Ser He Tyr Tyr His Leu Pro His Leu 
100 105 HO 115 

TTG CAA AAT GAA GGA AGC CTT CAA CCT GCC GTG CAG ATC GGA AAT GGA 680 
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Leu Gin Asn Glu Gly Ser Leu Gin Pro Ala Val Gin He Gly Asn Gly 

120 125 130 

CGA ACA GGA GTT TCA ATA GTA ATG GGA ATT CCT ACA GTG AAG AGA GAA 728 

Arg Thr Gly Val Ser He Val Met Gly He Pro Thr Val Lys Arg Glu 

135 140 145 

GTT AAA TCT TAC CTC ATA GAA ACT CTT CAT TCC CTT ATT GAT AAT CTG 776 

Val Lys Ser Tyr Leu He Glu Thr Leu His Ser Leu He Asp Asn Leu 

150 155 160 

TAT CCT GAA GAG AAG TTG GAC TGT GTT ATA GTA GTC TTC ATA GGA GAG 824 

i 

Tyr Pro Glu Glu Lys Leu Asp Cys Val He Val Val Phe He Gly Glu 

165 170 . 175 

ACA GAT ACT GAT TAT GTA AAT GGT GTT GTA GCC AAC CTG GAG AAA GAA 872 
Thr Asp Thr Asp Tyr Val Asn Gly Val Val Ala Asn Leu Glu Lys Glu 
25 i8Q 185 190 195 

TTT TCT AAA GAA ATC AGT TCT GGC TTG GTG GAA ATA ATA TCA CCT CCT 920 
Phe Ser Lys Glu He Ser Ser Gly Leu Val Glu He He Ser Pro Pro 

200 205 210 

GAA AGC TAT TAT CCT GAC CTG ACG AAC TTA AAG GAG ACA TTT GGA GAT 96 8 
Glu Ser Tyr Tyr Pro Asp Leu Thr Asn Leu Lys Glu Thr Phe Gly Asp 

215 220 225 

TCT AAA GAA AGA GTA AGA TGG AGA ACA AAG CAA AAC CTA GAT TAT TGT 1016 
Ser Lys Glu Arg Val Arg Trp Arg Thr Lys Gin Asn Leu Asp Tyr Cys 

230 235 240 

TTT CTA ATG ATG TAT GCT CAG GAA AAA GGC ACA TAC TAC ATC CAG CTT 1064 
Phe Leu Met Met Tyr Ala Cln Glu Lys Gly Thr Tyr Tyr He Gin Leu 

245 250 255 

GAA GAT GAT ATT ATT GTC AAA CAG AAT TAC TTT AAC ACC ATA AAG AAT 1112 
Glu Asp Asp He He Val Lys Gin Asn Tyr Phe Asn Thr He Lys Asn 
260 265 270 275 
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TTT GCA CTT CAA CTT TCT TCT GAG GAA TGG ATG ATA CTT GAG TTC TCC 1160 
Phe Ala Leu Gin Leu Ser Ser Glu Glu Trp Met He Leu Glu Phe Ser 

280 285 290 

CAG CTG GGA TTC ATT GGT AAA ATG TTT CAA GCA CCT GAC CCA CTC CTG 1208 
Gin Leu Gly Phe lie Gly Lys Met Phe Gin Ala Pro Asp Leu Thr Leu 

295 300 305 

ATT GTG GAA TTC ATA TTT ATG TTC TAT AAG GAG AAG CCC ATC GAC TGG 1256 
He Val Glu Phe He Phe Met Phe Tyr Lys Glu Lys Pro He Asp Trp 
310 315 320 

i 

CTC TTG GAC CAT ATT CTG TGG GTC AAA GTC TGC AAC CCG GAA AAA GAT 1304 

T r ^,, u;/. tIa T«ii T>rr> Wal T y c 1 CVS ASP. PTO Gl'J. LyS ASp 

325 330 335 

GCA AAA CAC TGT GAT CGA CAG AAG GCA AAT CTG CGA ATT CGT TTC AG A 13 52 
Ala Lys His Cys Asp Arg Gin Lys Ala Asn Leu Arg He Arg Phe Arg 
340 345 350 355 

CCG TCC CTT TTC CAA CAC GTT GGT CTG CAT TCT TCA CTC ACA GGA AAA 1400 
Pro Ser Leu Phe Gin His Val Gly Leu His Ser Ser Leu Thr Gly Lys 

360 365 370 

ATT CAG AAA CTC ACG GAT AAA GAT TAC ATG AAA CCA TTA CTG CTC AAA 14 48 
He Gin Lys Leu Thr Asp Lys Asp Tyr Met Lys Pro Leu Leu Leu Lys 

375 380 385 

ATC CAT GTA AAC CCC CCT GCA GAG GTA TCT ACT TCT TTG AAG GTC TAC 14 96 
He His Val Asn Pro Pro Ala Glu Val Ser Thr Ser Leu Lys Val Tyr 

390 395 400 

CAA GGT CAT ACA CTG GAG AAA ACT TAC ATG GGT GAG GAC TTC TTC TGG 154 4 
Gin Gly His Thr Leu Glu Lys Thr Tyr Met Gly Glu Asp Phe Phe Trp 

405 410 415 

GCT ATA ACC CCA GTA GCT GGA GAC TAC ATC CTA TTT AAA TTC GAC AAG 1592 
Ala He Thr Pro Val Ala Gly Asp Tyr lie Leu Phe Lys Phe Asp Lys 
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420 425 430 435 

CCA GTC AAT GTG GAA AGT TAT TTG TTC CAT AGT GGC AAC CAG GAT CAT 1640 
Pro Val Asn Val Glu Ser Tyr Leu Phe His Ser Gly Asn Gin Asp His 

440 445 450 

CCA GGG GAT ATT CTG CTC AAC ACA ACG GTG GAA GTT CTG CCT TTG AAG 1688 
Pro Gly Asp lie Leu Leu Asn Thr Thr Val Glu Val Leu Pro Leu Lys 

455 460 465 

AGT GAA GGT TTG GAC ATC AGC AAA GAA ACC AAA GAC AAA CGA TTA GAA 1736 
Ser Glu Gly Leu Asp He Ser Lys Glu Thr Lys Asp Lys Arg Leu Glu 

i 

470 475 480 

GAT GGC TAT TTC AG A ATA GGG AAA TTT GAA AAC GGT GTT GCG GAA GGG 1784 
Asp Gly Tyr Phe Arg He Gly Lys Phe Glu Asn Gly Val Ala Glu Gly 

485 490 495 

ATG GTG GAT CCC AGC CTA AAC CCC ATT TCG GCC TTC CGA CTT TCA GTT 1832 
Met Val Asp Pro Ser Leu Asn Pro He Ser Ala Phe Arg Leu Ser Val 500 
500 505 510 515 

ATT CAG AAT TCT GCT GTT TGG GCC ATT CTT AAT GAG ATC CAT ATT AAA 1880 
Tie Gin Asn Ser Ala Val Trp Ala He Leu Asn Glu lie His He Lys 

520 525 530 

AAA GTC ACA AAC TGACCATC 1900 
Lys Val Thr Asn 
535 

TCTACTAAGA AACCAACACA TTTTTTCCCT GTGAATTTGT TGATTAAAGA CAGCTGAGCA 1960 
CGTACCTTTT TTTGGTAACT TGAATTCTAC CTCTCGCGAA ATCTACTGTA GATAAAATGA 2020 
TTGTCATATT T C C ACTTGG A AA ATG A AT CT CCCACGGATA ATTGTATTCA TTTGAATCTA 2080 
AGCTGTCCTC CAGTTTTAAC TCAACTCAAA CGTTTTACAG TT ATG AC AGC CTGTTAATAT 214 0 
GACTTGTACT ATTTTGGTAT TATACTAATA CATAAGAGTT GTACATATTG TTACATTCAT 2200 
TAAATTTGAG AAAAATTAAT GTTAAATACA TTTTATGAAC GGGCCG 2246 
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SEQ ID NO: 18 
SEQUENCE LENGTH: 53 5 
SEQUENCE TYPE: amino acid 
TOPOLOGY : 1 i nea r 
MOLECULE TYPE : peptide 



SEQUENCE DESCRIPTION 

Met Arg Leu Arg Asn Gly Thr Val Ala Thr Val Leu Ala Phe He Thr 

15 10 15 

Ser Phe Leu Thr Leu Ser Trp Tyr Thr Thr Trp Gin Asn Gly Lys Glu 

20 25 30 

Lys Val He Ala Tyr Gin Arg Glu Phe Leu Ala Leu Lys Glu Arg Leu 

35 40 45 

Arg He Ala Glu His Arg He Ser Gin Arg Ser Ser Glu Leu Ser Ala 

50 55 60 

He Val Gin Gin Phe Lys Arg Val Glu Ala Glu Thr Asn Arg Ser Lys 
65 70 75 80 

Asp Pro Val Asn Lys Phe Ser Asp Asp Thr Leu Lys He Leu Lys Glu 

85 90 95 

Leu Thr Ser Lys Lys Ser Leu Gin Val Pro Ser He Tyr Tyr His Leu 

100 105 110 

Pro His Leu Leu Gin Asn Glu Gly Ser Leu Gin Pro Ala Val Gin He 

115 120 125 

Gly Asn Gly Arg Thr Gly Val Ser He Val Met Gly He Pro Thr Val 

130 135 140 

Lys Arg Glu Val Lys Ser Tyr Leu He Glu Thr Leu His Ser Leu He 
145 150 155 160 

Asp Asn Leu Tyr Pro Glu Glu Lys Leu Asp Cys val He Val Val Phe 
165 170 175 
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He Gly Glu Thr Asp Thr Asp Tyr Val Asn Gly Val Val Ala Asn Leu 

180 185 190 

Glu Lys Glu Phe Ser Lys Glu He Ser ser Gly Leu val Glu He He 

!95 200 205 

Ser Pro Pro Glu Ser Tyr Tyr Pro Asp Leu Thr Asn Leu Lys Glu Thr 

210 215 220 

Phe Gly Asp Ser Lys Glu Arg Val Arg Trp Arg Thr Lys Gin Asn Leu 
225 230 235 240 

Asp Tyr Cys Phe Leu Met Met Tyr Ala Gin Glu Lys Gly Thr Tyr Tyr 

245 250 255 

He Gin Leu Glu Asp Asp He He Val Lys Gin Asn Tyr Phe Asn Thr 

260 265 270 

He Lys Asn Phe Ala Leu Gin Leu Ser Ser Glu Glu Trp Met He Leu 

275 280 285 

Glu Phe Ser Gin Leu Gly Phe He Gly Lys Met Phe Gin Ala Pro Asp 

290 295 300 

Leu Thr Leu He Val Glu Phe He Phe Met Phe Tyr Lys Glu Lys Pro 
305 310 315 320 

He Asp Trp Leu Leu Asp His He Leu Trp Val Lys Val Cys Asn Pro 

325 330 335 

Glu Lys Asp Ala Lys His Cys Asp Arg Gin Lys Ala Asn Leu Arg He 

340 345 350 

Arg Phe Arg Pro Ser Leu Phe Gin His Val Gly Leu His Ser Ser Leu 

355 360 365 

Thr Gly Lys He Gin Lys Leu Thr Asp Lys Asp Tyr Met Lys Pro Leu 

370 375 380 

Leu Leu Lys He His Val Asn Pro Pro Ala Glu Val Ser Thr Ser Leu 
385 390 395 400 

r.ys Val Tyr Gin Gly His Thr Leu Glu Lys Thr Tyr Met Gly Glu Asp 
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Phe Phe Trp Ala 
420 

Phe Asp Lys Pro 
435 

Gin Asp His Pro 
450 

Pro Leu Lys Ser 
465 

Arg Leu Glu Asp 

Ala Glu Gly Met 
500 

Leu Ser Val lie 
515 

His lie Lys Lys 
530 



405 

lie Thr Pro Val 

Val Asn Val Glu 
440 

Gly Asp lie Leu 
455 

Glu Gly Leu Asp 
470 

Gly Tyr Phe Arg 
485 

Val Asp Pro Ser 

Gin Asn Ser Ala 
520 

Val Thr Asn 
535 



410 

Ala Gly Asp Tyr 
425 

Ser Tyr Leu Phe 

Leu Asn Thr Thr 
460 

lie Ser Lys Glu 
475 

lie Gly Lys Phe 
490 

Leu Asn Pro lie 
505 

Val Trp Ala lie 



415 

lie Leu Phe Lys 
430 

His Ser Gly Asn 
445 

Val Glu Val Leu 

Thr Lys Asp Lys 
480 

Glu Asn Gly Val 
495 

Ser Ala Phe Arg 
510 

Leu Asn Glu lie 
525 



SEQ ID NO: 19 

SEQUENCE LENGTH: 31 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS : single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 



SEQUENCE DESCRIPTION 
CCCTCGAG ATG AGG CTC 
Met Arg Leu 



CGA A AT GGA ACT GT 31 
Arg Asn Gly Thr Val 
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SEQ ID NO: 20 

SEQUENCE LENGTH: 31 

SEQUENCE TYPE: nucleic acid 

STRANDEDNES S : s ing 1 e 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

ANTISENSE: Yes 

SEQUENCE DESCRIPTION 

TTTCTAGA TCA GTT TGT GAC TTT TTT AAT AT 
TRM Asn Thr Val Lys Lys He His 

SEQ ID NO: 21 

SEQUENCE LENGTH: 24 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION 
ACGATTGTGC AACAGTTCAA GCGT 24 

SEQ ID NO: 22 

SEQUENCE LENGTH: 24 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS : single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 
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10 



SEQUENCE DESCRIPTION 
GGGAGAACTC CAGGATCATC CAGT 24 

SEQ ID NO: 23 
SEQUENCE LENGTH: 2115 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: double 
is TOPOLOGY: linear 

i 

MOLECULE TYPE: CDNA 

on 

SEQUENCE DESCRIPTION 

GAAATGAACC TCTCTTATTG ATTTTTATTG GCCTAGAGCC AGGAGTACTG CATTCAGTTG 60 
ACTTTCAGGG TAAAAAGAAA ACAGTCCTGG TTGTTGTCAT CATAAACATA TGGACCAGTG 120 
TGATGGTGAA ATGAG ATG AGG CTC CGC A AT GGA ACT GTA GCC ACT GCT TTA 171 
Met Arg Leu Arg Asn Gly Thr Val Ala Thr Ala Leu 
15 10 
GCA TTT ATC ACT TCC TTC CTT ACT TTG TCT TGG TAT ACT AC A TGG CAA 219 
Ala Phe lie Thr Ser Phe Leu Thr Leu Ser Trp Tyr Thr Thr Trp Gin 
55 15 20 25 

AAT GGG AAA GAA AAA CTG ATT GCT TAT CAA CGA GAA TTC CTT GCT TTG 267 
Asn Gly Lys Glu Lys Leu lie Ala Tyr Gin Arg Glu Phe Leu Ala Leu 

30 35 40 

AAA GAA CGT CTT CGA ATA GCT GAA CAC AGA ATC TCA CAG CGC TCT TCT 315 
Lys Glu Arg Leu Arg lie Ala Glu His Arg lie Ser Gin Arg Ser Ser 
45 50 55 60 

GAA TTA AAT ACG ATT GTG CAA CAG TTC AAG CGT GTA GGA GCA GAA AC A 3 63 
Glu Leu Asn Thr He Val Gin Gin Phe Lys Arg Val Gly Ala Glu Thr 
65 70 75 
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AAT GGA AGT AAG GAT GCG 
Asn Gly Ser Lys Asp Ala 
80 

CTG TTA AAG GAG TTA ACA 
Leu Leu Lys Glu Leu Thr 
95 

TAT TAT CAT TTG CCT CAT 
Tyr Tyr His Leu Pro His 
110 

GCT GTA CAG ATT GGC AAC 
Ala Val Gin He Gly Asn 
125 130 
ATT CCC ACA GTG AAG AGA 
He Pro Thr val Lys Arg 
145 

CAT TCC CTT ATT GAT AAC 
His Ser Leu lie Asp Asn 
160 

ATA GTA GTC TTC ATA GGA 
He Val Val Phe He Gly 
175 

GTA GCC AAC CTG GAG AAA 
Val Ala Asn Leu Glu Lys 
190 

GTG GAA GTC ATA TCA CCC 
Val Glu Val He Ser Pro 
205 210 
CTA AAG GAG ACA TTT GGA 
Leu Lys Glu Thr Phe Gly 



TTG AAT AAG TTT TCA 
Leu Asn Lys Phe Ser 
85 

AGC AAA AAA TCT CTT 
Ser Lys Lys Ser Leu 
100 

TTA TTG AAA AAT GAA 
Leu Leu Lys Asn Glu 
115 

GGA AGA ACA GGA GTT 
Gly Arg Thr Gly Val 
13 5 

GAA GTT AAA TCT TAC 
Glu Val Lys Ser Tyr 
150 

CTG TAT CCT GAA GAG 
Leu Tyr Pro Glu Glu 
165 

GAG ACA GAT ATT GAT 
Glu Thr Asp He Asp 
180 

GAA TTT TCT AAA GAA 
Glu Phe Ser Lys Glu 
195 

CCT GAA AGC TAT TAT 
Pro Glu Ser Tyr Tyr 
215 

GAC TCC AAA GAA AGA 
Asp Ser Lys Glu Arg 



GAT AAT ACC CTA AAG 411 
Asp Asn Thr Leu Lys 
90 

CAA GTG CCA AGT ATT 4 59 
Gin Val Pro Ser He 
105 

GGA AGT CTT CAA CCT 507 
Gly Ser Leu Gin Pro 
120 

TCA ATA GTC ATG GGC 555 
Ser He Val Met Gly 
140 

CTC ATA GAA ACT CTT 603 
Leu lie Glu Thr Leu 
155 

AAG TTG GAC TGT GTT 651 
Lys Leu Asp Cys Val 
170 

TAT GTA CAT GGT GTT 699 
Tyr Val His Gly Val 
185 

ATC AGT TCT GGC TTG 747 
He Ser Ser Gly Leu 
200 

CCT GAC TTG ACA AAC 7 95 
Pro Asp Leu Thr Asn 
220 

GTA AGA TGG AGA ACA 8 43 
Val Arg Trp Arg Thr 
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225 230 235 

AAG CAA A AC CTA GAT TAC TGT TTT CTA ATG ATG TAT GCT CAA GAA AAG 891 
Lys Gin Asn Leu Asp Tyr Cys Phe Leu Met Met Tyr Ala Gin Glu Lys 

240 245 250 

GCC ATA TAT TAC ATT CAG CTT GAA GAT GAT ATT ATT GTC AAA CAA AAT 93 9 
Gly He Tyr Tyr lie Gin Leu Glu Asp Asp He He Val Lys Gin Asn 

255 260 265 

TAT TTT AAT ACC ATA AAA AAT TTT GCA CTT CAA CTT TCT TCT GAG GAA 987 
Tyr Phe Asn Thr He Lys Asn Phe Ala Leu Gin Leu Ser Ser Glu Glu 

i 

270 275 280 

TGG ATG ATT CTA GAG TTT TCC CAG CTG GGC TTC ATT GGT AAA ATG TTT 103 5 
Trp Met He Leu Glu Phe Ser Gin Leu Gly Phe He Gly Lys Met Phe 
285 290 295 300 

CAA GCG CCG GAT CTT ACT CTG ATT GTA GAA TTC ATA TTC ATG TTT TAC 1083 
Gin Ala Pro Asp Leu Thr Leu He Val Glu Phe He Phe Met Phe Tyr 

305 310 315 

AAG GAG AAA CCC ATT GAT TGG CTC CTG GAC CAT ATT CTC TGG GTG AAA 1131 
Lys Glu Lys Pro lie Asp Trp Leu Leu Asp His lie Leu Trp Val Lys 

320 325 330 

GTC TGC AAC CCT GAA AAA GAT GCA AAA CAT TGT GAT AG A CAG AAA GCA 1179 
Val Cys Asn Pro Glu Lys Asp Ala Lys His Cys Asp Arg Gin Lys Ala 

335 340 345 

AAT CTG CGA ATT CGC TTC AGA CCT TCC CTT TTC CAA CAT GTT GGT CTG 1227 
Asn Leu Arg He Arg Phe Arg Pro Ser Leu Phe Gin His Val Gly Leu 

350 355 360 

CAC TCA TCA CTA TCA GGA AAA ATC CAA AAA CTC ACG GAT AAA GAT TAT 127 5 
His Ser Ser Leu Ser Gly Lys lie Gin Lys Leu Thr Asp Lys Asp Tyr 
365 370 375 380 

ATG AAA CCA TTA CTT CTT AAA ATC CAT GTA AAC CCA CCT GCG GAG GTA 13 23 
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Met Lys Pro Leu Leu Leu Lys He His Val Asn Pro Pro Ala Glu Val 

385 390 395 

TCT ACT TCC TTG AAG GTC TAC CAA GGG CAT ACG CTG GAG AAA ACT TAC 13 71 

Ser Thr Ser Leu Lys Val Tyr Gin Gly His Thr Leu Glu Lys Thr Tyr 

400 405 410 

ATG GGA GAG GAT TTC TTC TGG GCT ATC ACA CCG ATA GCT GGA GAC TAC 1419 ' 

Met Gly Glu Asp Phe Phe Trp Ala He Thr Pro He Ala Gly Asp Tyr 

415 420 425 

ATC TTG TTT AAA TTT GAT AAA CCA GTC AAT GTA GAA AGT TAT TTG TTC 1467 

i 

He Leu Phe Lys Phe Asp Lys Pro Val Asn Val Glu Ser Tyr Leu Phe 

430 435 440 

CAT AGC GGC AAC CAA GAA CAT CCT GGA GAT ATT CTG CTA AAC ACA ACT 1515 
His Ser Gly Asn Gin Glu His Pro Gly Asp He Leu Leu Asn Thr Thr 
445 450 455 460 

GTG GAA GTT TTG CCT TTT AAG AGT GAA GGT TTG GAA ATA AGC AAA GAA 1563 
Val Glu Val Leu Pro Phe Lys Ser Glu Gly Leu Glu He Ser Lys Glu 

465 470 475 

ACC AAA GAC AAA CGA TTA GAA GAT GGC TAT TTC AGA ATA GGA AAA TTT 1611 
Thr Lys Asp Lys Arg Leu Glu Asp Gly Tyr Phe Arg He Gly Lys Phe 

480 485 490 

GAG AAT GGT GTT GCA GAA GGA ATG GTG GAT CCA AGT CTC AAT CCC ATT 1659 
Glu Asn Gly Val Ala Glu Gly Met Val Asp Pro Ser Leu Asn Pro He 

495 500 505 

TCA GCC TTT CGA CTT TCA GTT ATT CAG AAT TCT GCT GTT TGG GCC ATT 1707 
Ser Ala Phe Arg Leu Ser Val He Gin Asn Ser Ala Val Trp Ala He 

510 515 520 

CTT AAT GAG ATT CAT ATT AAA AAA GCC ACC AAC TGATCATCTG AGAAACCAAC 1760 
Leu Asn Glu He His He Lys Lys Ala Thr Asn 
525 530 535 
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ACATTTTTTC CTGTGAATTT GTTAATTAAA GATAGTTAAG CATGTATCTT TTTTTTATTT 1820 

CTACTTGAAC ACTACCTCTT GTGAAGTCTA CTGTAGATAA GACGATTGTC ATTTCCACTT 1880 

GGAAAGTGAA TCTCCCATAA TAATTGTATT TGTTTGAAAC TAAGCTGTCC TCAGATTTTA 1940 

ACTTGACTCA AACATTTTTC AATTATGACA GCCTGTTAAT ATGACTTGTA CTATTTTGGT 2000 

ATT AT ACT AA TACATAAGAG TTGTACATAT TGTTACATTC TTTAAATTTG AGAAAAACTA 2060 

ATGTTACATA CATTTTATGA AGGGGGTACT TTTGAGGTTC ACTTATTTTA CTATT 2115' 

SEQ ID NO: 24 
SEQUENCE LENGTH: 535 
SEQUENCE TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 

SEQUENCE DESCRIPTION 

Met Arg Leu Arg Asn Gly Thr Val Ala Thr Ala Leu Ala Phe lie Thr 

15 10 15 

Ser Phe Leu Thr Leu Ser Trp Tyr Thr Thr Trp Gin Asn Gly Lys Glu 

20 25 30 

Lys Leu lie Ala Tyr Gin Arg Glu Phe Leu Ala Leu Lys Glu Arg Leu 

35 40 45 

Arg He Ala Glu His Arg He Ser Gin Arg Ser Ser Glu Leu Asn Thr 

50 55 60 

He Val Gin Gin Phe Lys Arg Val Gly Ala Glu Thr Asn Gly Ser Lys 
65 70 75 80 

Asp Ala Leu Asn Lys Phe Ser Asp Asn Thr Leu Lys Leu Leu Lys Glu 

85 90 95 

Leu Thr Ser Lys Lys Ser Leu Gin Val Pro Ser He Tyr Tyr His Leu 

100 105 HO 

Pro His Leu Leu Lys Asn Glu Gly Ser Leu Gin Pro Ala Val Gin He 
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115 120 125 

Gly Asn Gly Arg Thr Gly Val Ser He Val Met Gly He Pro Thr Val 

130 135 140 

Lys Arg Glu Val Lys Ser Tyr Leu He Glu Thr Leu His Ser Leu He 
145 150 155 160 

Asp Asn Leu Tyr Pro Glu Glu Lys Leu Asp Cys Val He Val Val Phe 

165 170 175 

He Gly Glu Thr Asp He Asp Tyr Val His Gly Val Val Ala Asn Leu 

180 185 190 

Glu Lys Glu Phe Ser Lys Glu He Ser Ser Gly Leu Val Glu Val He 

195 200 205 

Ser Pro Pro Glu Ser Tyr Tyr Pro Asp Leu Thr Asn Leu Lys Glu Thr 

210 215 220 

Phe Gly Asp Ser Lys Glu Arg Val Arg Trp Arg Thr Lys Gin Asn Leu 
225 230 235 240 

Asp Tyr Cys Phe Leu Met Met Tyr Ala Gin Glu Lys Gly He Tyr Tyr 

245 250 255 

He Gin Leu Glu Asp Asp He He Val Lys Gin Asn Tyr Phe Asn Thr 

260 265 270 

He Lys Asn Phe Ala Leu Gin Leu Ser Ser Glu Glu Trp Met He Leu 

275 280 285 

Glu Phe Ser Gin Leu Gly Phe He Gly Lys Met Phe Gin Ala Pro Asp 

290 295 300 

Leu Thr Leu He Val Glu Phe He Phe Met Phe Tyr Lys Glu Lys Pro 
305 310 315 320 

He Asp Trp Leu Leu Asp His He Leu Trp Val Lys Val Cys Asn Pro 

325 330 335 

Glu Lys Asp Ala Lys His Cys Asp Arg Gin Lys Ala Asn Leu Arg He 
340 345 350 
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Arg Phe Arg Pro Ser Leu Phe Gin His Val Gly Leu His Ser Ser Leu 

355 360 365 

Ser Gly Lys lie Gin Lys Leu Thr Asp Lys Asp Tyr Met Lys Pro Leu 

370 375 380 

Leu Leu Lys He His Val Asn Pro Pro Ala Glu Val Ser Thr Ser Leu 
385 390 395 400 

Lys Val Tyr Gin Gly His Thr Leu Glu Lys Thr Tyr Met Gly Giu Asp 

405 410 415 

Phe Phe Trp Ala He Thr Pro He Ala Gly Asp Tyr He Leu Phe Lys 

420 425 430 

Phe Asp Lys Pro Val Asn Val Glu Ser Tyr Leu Phe His Ser Gly Asn 

435 440 445 

Gin Glu His Pro Gly Asp He Leu Leu Asn Thr Thr Val Glu Val Leu 

450 455 460 

Pro Phe Lys Ser Glu Gly Leu Glu He Ser Lys Glu Thr Lys Asp Lys 
465 470 475 480 

Arg Leu Glu Asp Gly Tyr Phe Arg He Gly Lys Phe Glu Asn Gly Val 

485 490 495 

Ala Glu Gly Met Val Asp Pro Ser Leu Asn Pro He Ser Ala Phe Arg 

500 505 510 

Leu Ser Val He Gin Asn Ser Ala Val Trp Ala He Leu Asn Glu He 

515 520 525 

His He Lys Lys Ala Thr Asn 
530 535 



SEQ ID NO: 25 
SEQUENCE LENGTH: 30 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : single 
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TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION 

TTCTCGAGAT GAGGCTCCGC AATGGAACTG 

SEQ ID NO: 26 

SEQUENCE LENGTH : 24 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS : single 

TOPOLOGY : linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION 
AGAAATGTGG GCTTCAGGGC TGGC 24 

SEQ ID NO: 27 

SEQUENCE LENGTH: 30 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION 

TTCTCGAGAT GAGGCTCCGC AATGGAACTG 

SEQ ID NO: 28 
SEQUENCE LENGTH: 24 
SEQUENCE TYPE: nucleic acid 
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STRANDEDNESS : single 

TOPOLOGY : linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION 

AGAAATGTGG GCTTCAGGGC TGGC 2 

SEQ ID NO: 29 

SEQUENCE LENGTH: 30 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS : single 

TOPOLOGY : 1 inear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION 

TTCTCGAGAT GAGGCTCCGC AATGGAACTG 

SEQ ID NO: 30 

SEQUENCE LENGTH: 2 4 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS : single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION 

AGAAATGTGG GCTTCAGGGC TGGC 24 

SEQ ID NO: 31 
SEQUENCE LENGTH: 2S 
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SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION 
TTCCATCACC TGCCACACCT GCTGG 25 

SEQ ID NO: 3 2 

SEQUENCE LENGTH: 24 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS : single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION 
ACAACCCTCA GTCAGACAAG GAGG 24 

SEQ ID NO: 33 

SEQUENCE LENGTH: 24 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS : single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION 
ACACCCCCAG AAATGTGGGC TTCA 24 

SEQ ID NO: 3 4 
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SEQUENCE LENGTH: 24 
SEQUENCE TYPE: nucleic acid 

5 

STRANDEDNESS: single 

TOPOLOGY; linear 

MOLECULE TYPE: synthetic DNA 

70 

SEQUENCE DESCRIPTION 
1S ATGACCGAGT CCTCCTTCTC CTGC 24 

i 

SEQ ID NO: 3 5 
so SEQUENCE LENGTH: 24 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS : single 
25 TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

30 

SEQUENCE DESCRIPTION 
ATGCCCATCA CCACCGACAC TCCG 2 4 

35 

SEQ ID NO: 3 6 

SEQUENCE LENGTH: 1724 
AQ SEQUENCE TYPE: nucleic acid 

STRANDEDNESS : double 

TOPOLOGY: linear 
45 MOLECULE TYPE: CDNA 

SEQUENCE DESCRIPTION 
*> TGCAGCCrCG GCCCCGCGGG CGCCCGCCGC GCACCCGACG AG ATG AGG CTC CGC 54 

Met Arg Leu Arg 
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w 



15 



1 

AAT GGC ACC TTC CTG ACG CTG CTG CTC TTC TGC CTG TGC GCC TTC CTC 102 
Asn Gly Thr Phe Leu Thr Leu Leu Leu Phe Cys Leu Cys Ala Phe Leu 
5 10 15 20 

TCG CTG TCC TGG TAC GCG GCA CTC AGC GGC CAG AAA GGC GAC GTT GTG 150 
Ser Leu Ser Trp Tyr Ala Ala Leu Ser Gly Gin Lys Gly Asp Val val 

25 30 35 

GAC GTT TAC CAG CGG GAG TTC CTG GCG CTG CGC GAT CGG TTG CAC GCA 198 
Asp Val Tyr Gin Arg Glu Phe Leu Ala Leu Arg Asp Arg Leu His Ala 

40 45 50 

GCT GAG CAG GAG AGC CTC AAG CGC TCC AAG GAG CTC AAC CTG GTG CTG 246 
Ala Glu Gin Glu Ser Leu Lys Arg Ser Lys Glu Leu Asn Leu Val Leu 

55 60 65 

GAC GAG ATC AAG AGG GCC GTG TCA GAA AGG CAG GCG CTG CGA GAC GGA 294 
Asp Glu He Lys Arg Ala Val Ser Glu Arg Gin Ala Leu Arg Asp Gly 

70 75 80 

GAC GGC AAT CGC ACC TGG GGC CGC CTA AC A GAG GAC CCC CGA TTG AAG 3 42 
Asp Gly Asn Arg Thr Trp Gly Arg Leu Thr Glu Asp Pro Arg Leu Lys 
85 90 95 100 

CCG TGG AAC GGC TCA CAC CGG CAC GTG CTG CAC CTG CCC ACC GTC TTC 3 90 
Pro Trp Asn Gly Ser His Arg His Val Leu His Leu Pro Thr Val Phe 

105 HO 115 

CAT CAC CTG CCA CAC CTG CTG GCC AAG GAG AGC AGT CTG CAG CCC GCG 438 
His His Leu Pro His Leu Leu Ala Lys Glu Ser Ser Leu Gin Pro Ala 
45 120 12 5 130 

GTG CGC GTG GGC CAG GGC CGC ACC GGA GTG TCG GTG GTG ATG GGC ATC 486 
Val Arg Val Gly Gin Gly Arg Thr Gly Val Ser Val Val Met Gly He 

135 140 145 

CCG AGC GTG CGG CGC GAG GTG CAC TCG TAC CTG ACT GAC ACT CTG CAC 53 4 



20 



25 



30 



35 



40 



SO 
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Pro Scr Val Arg Arg Glu 
150 

TCG CTC ATC TCC GAG CTG 
ser Leu lie Ser Glu Leu 
165 170 
GTG GTG CTG ATC GCC GAG 
Val Val Leu lie Ala Glu 
185 

GAG AAC ATC AAG GCC TTG 
Glu Asn lie Lys Ala Leu 
200 

GAG GTC ATC TCA CCC TCC 
Glu Val lie Ser Pro Ser 
215 

CGA GAG TCC TTT GGG GAC 
Arg Glu Ser Phe Gly Asp 
230 

CAG AAC CTC GAT TAC TGC 
Gin Asn Leu Asp Tyr Cys 
245 250 
ATC TAC TAC GTG CAG CTG 
lie Tyr Tyr Val Gin Leu 
265 

CTG AGC ACC ATG AAG AAC 
Leu Ser Thr Met Lys Asn 
280 

ATG ATC CTG GAG TTC TCC 
Met He Leu Glu Phe Ser 
295 



Val His Ser Tyr Leu 
155 

AGC CCG CAG GAG AAG 
Ser Pro Gin Glu Lys 
175 

ACT GAC TCA CAG TAC 
Thr Asp Ser Gin Tyr 
190 

TTC CCC ACG GAG ATC 
Phe Pro Thr Glu He 
205 

CCC CAC TTC TAC CCT 
Pro His Phe Tyr Pro 
220 

CCC AAG GAG AGA GTC 
Pro Lys Glu Arg Val 
235 

TTC CTC ATG ATG TAC 

Phe Leu Met Met Tyr 
255 

GAG GAT GAC ATC GTG 
Glu Asp Asp He Val 
270 

TTT GCA CTG CAG CAG 
Phe Ala Leu Gin Gin 
285 

CAG CTG GGC TTC ATT 
Gin Leu Gly Phe He 
300 



Thr Asp Thr Leu His 
160 

GAG GAC TCG GTC ATC 58 2 
Glu Asp Ser Val He 
180 

ACT TCG GCA GTG ACA 630 
Thr Ser Ala Val Thr 
195 

CAT TCT GGG CTC CTG 678 

i 

His Ser Gly Leu Leu 

2i0 

GAC TTC TCC CGC CTC 726 
Asp Phe Ser Arg Leu 
225 

AGG TGG AGG ACC AAA 774 
Arg Trp Arg Thr Lys 
240 

GCG CAG TCC AAA GGC 822 
Ala Gin Ser Lys Gly 
260 

GCC AAG CCC AAC TAC 870 
Ala Lys Pro Asn Tyr 
275 

CCT TCA GAG GAC TGG 918 
Pro Ser Glu Asp Trp 
290 

GGT AAG ATG TTC AAG 966 
Gly Lys Met Phe Lys 
305 
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TCG CTG GAC CTG AGC CTG ATT GTA GAG TTC ATT CTC ATG TTC TAC CGG 1014 
Ser Leu Asp Leu Ser Leu lie Val Glu Phe lie Leu Met Phe Tyr Arg 

310 315 320 

GAC AAG CCC ATC GAC TGG CTC CTG GAC CAT ATT CTG TGG GTG AAA GTC 1062 
Asp Lys Pro He Asp Trp Leu Leu Asp His He Leu Trp Val Lys Val 
325 330 335 340 

TGC AAC CCC GAG AAG GAT GCG AAG CAC TGT GAC CGG CAG AAA GCC AAC 1110 
Cys Asn Pro Glu Lys Asp Ala Lys His Cys Asp Arg Gin Lys Ala Asn 

345 350 355 

CTG CGG ATC CGC TTC AAA CCG TCC CTC TTC CAG CAC GTG GGC ACT CAC 1158 
Leu Arg He Arg Phe Lys Pro Ser Leu Phe Gin His Val Gly Thr His 

360 365 370 

TCC TCG CTG GCT GGC AAG ATC CAG AAA CTG AAG GAC AAA GAC TTT GGA 1206 
Ser Ser Leu Ala Gly Lys He Gin Lys Leu Lys Asp Lys Asp Phe Gly 

375 380 385 

AAG CAG GCG CTG CGG AAG GAG CAT GTG AAC CCG CCA GCA GAG GTG AGC 12 54 
Lys Gin Ala Leu Arg Lys Glu His Val Asn Pro Pro Ala Glu Val Ser 

390 395 400 

ACG AGC CTG AAG ACA TAC CAG CAC TTC ACC CTG GAG AAA GCC TAC CTG 1302 
Thr Ser Leu Lys Thr Tyr Gin His Phe Thr Leu Glu Lys Ala Tyr Leu 
405 410 415 420 

CGC GAG GAC TTC TTC TGG GCC TTC ACC CCT GCC GCG GGG GAC TTC ATC 1350 
Arg Glu Asp Phe Phe Trp>Ala Phe Thr Pro Ala Ala Gly Asp Phe He 

425 430 435 

CGC TTC CGC TTC TTC CAA CCT CTA AGA CTG GAG CGG TTC TTC TTC CGC 13 98 
Arg Phe Arg Phe Phe Gin Pro Leu Arg Leu Glu Arg Phe Phe Phe Arg 

440 445 450 

AGT GGG AAC ATC GAG CAC CCG GAG GAC AAG CTC TTC AAC ACG TCT GTG 14 46 
Ser Gly Asn He Glu His Pro Glu Asp Lys Leu Phe Asn Thr Ser Val 
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455 460 465 

GAG GTG CTG CCC TTC GAC AAC CCT CAG TCA GAC A AG GAG GCC CTG CAG 14 94 
Glu Val Leu Pro Phe Asp Asn Pro Gin Ser Asp Lys Glu Ala Leu Gin 

470 475 480 

GAG GGC CGC ACC GCC ACC CTC CGG TAC CCT CGG AGC CCC GAC GGC TAC 1542 
Glu Gly>Arg Thr Ala Thr Leu Arg Tyr Pro Arg Ser Pro Asp Gly Tyr 
485 490 495 500 

CTC CAG ATC GGC TCC TTC TAC AAG GGA GTG GCA GAG GGA GAG GTG GAC 1590 
Leu Gin lie Gly Ser Phe Tyr Lys Gly Val Ala Glu Gly Glu Val Asp 

t 

505 510 515 

CCA GCC TTC GGC CCT CTG GAA GCA CTG CGC CTC TCG ATC CAG ACG GAC 1638 
Pro Ala Phe Gly Pro Leu Glu Ala Leu Arg Leu Ser He Gin Thr Asp 

520 525 530 

TCC CCT GTG TGG GTG ATT CTG AGC GAG ATC TTC CTG AAA AAG GCC GAC 1686 
Ser Pro Val Trp Val He Leu Ser Glu He Phe Leu Lys Lys Ala Asp 
535 540 545 548 

TAAGCTGCGG GCTTCTGAGG GTACCCTGTG GCCAGCCC 1724 



SEQ ID NO: 3 7 
SEQUENCE LENGTH: 548 
SEQUENCE TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 



SEQUENCE DESCRIPTION 

Met Arg Leu Arg Asn Gly Thr Phe Leu Thr Leu Leu Leu Phe Cys Leu 

15 10 15 

Cys Ala Phe Leu Ser Leu Ser Trp Tyr Ala Ala Leu Ser Gly Gin Lys 
20 25 30 
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Gly Asp Val Val Asp Val Tyr Gin Arg Glu Phe Leu Ala Leu Arg Asp 

35 40 45 

Arg Leu His Ala Ala Glu Gin Glu Ser Leu Lys Arg Ser Lys Glu Leu 

50 55 60 

Asn Leu Val Leu Asp Glu lie Lys Arg Ala Val Ser Glu Arg Gin Ala 
65 - 70 75 80 

Leu Arg Asp Gly Asp Gly Asn Arg Thr Trp Gly Arg Leu Thr Glu Asp 

85 90 95 

Pro Arg Leu Lys Pro Trp Asn Gly Ser His Arg His Val Leu His Leu 

100 105 HO 

Pro Thr val Phe His His Leu Pro His Leu Leu Ala Lys Glu Ser Ser 

U5 120 125 

Leu Gin Pro Ala Val Arg Val Gly Gin Gly Arg Thr Gly Val Ser Val 

130 135 140 

val Met Gly He Pro Ser Val Arg Arg Glu Val His Ser Tyr Leu Thr 
145 150 155 160 

Asp Thr Leu His Ser Leu He Ser Glu Leu Ser Pro Gin Glu Lys Glu 

165 170 175 

As? Ser Val He Val Val Leu He Ala Glu Thr As? Ser Gin Tyr Thr 

180 185 190 

Ser Ala Val Thr Glu Asn He Lys Ala Leu Phe Pro Thr Glu He His 

195 200 205 

Ser Gly Leu Leu Glu Val He Ser Pro Ser Pro His Phe Tyr Pro Asp 

210 215 220 

Phe Ser Arg Leu Arg Glu Ser Phe Gly Asp Pro Lys Glu Arg. Val Arg 
225 230 235 240 

Trp Arg Thr Lys Gin Asn Leu Asp Tyr Cys Phe Leu Met Met Tyr Ala 

245 250 255 

Gin Ser Lys Gly He Tyr Tyr Val Gin Leu Glu Asp Asp He Val Ala 
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260 

Lys Pro Asn Tyr 
275 

Ser Glu Asp Trp 
290 

Lys Met Phe Lys 
305 

Met Phe Tyr Arg 

Trp Val Lys Val 
340 

Gin Lys Ala Asn 
355 

Val Gly Thr His 
370 

Lys Asp Phe Gly 
385 

Ala Glu Val Ser 

Lys Ala Tyr Leu 
420 

Gly Asp Phe lie 
435 

Phe Phe Phe Arg 
450 

Asn Thr Ser Val 
465 

Glu Ala Leu Gin 



Leu Ser Thr Met 
280 

Met lie Leu Glu 
295 

Ser Leu Asp Leu 
310 

Asp Lys Pro lie 
325 

Cys Asn Pro Glu 

Leu Arg lie Arg 
360 

Ser Ser Leu Ala 
375 

Lys Gin Ala Leu 
390 

Thr Ser Leu Lys 
405 

Arg Glu Asp Phe 

Arg Phe Arg Phe 
440 

Ser Gly Asn lie 
455 

Glu Val Leu Pro 
470 

Glu Gly Arg Thr 
485 



265 

Lys Asn Phe Ala 

Phe Ser Gin Leu 
300 

Ser Leu lie Val 
315 

Asp Trp Leu Leu 
330 

Lys Asp Ala Lys 
34 5 

Phe Lys Pro Ser 

Gly Lys lie Gin 
380 

Arg Lys Glu His 
395 

Thr Tyr Gin His 
410 

Phe Trp Ala Phe 
425 

Phe Gin Pro Leu 

Glu His Pro Glu 
460 

Phe Asp Asn Pro 
475 

Ala Thr Leu Arg 
490 



270 

Leu Gin Gin Pro 
285 

Gly Phe He Gly 

Glu Phe He Leu 
320 

Asp His He Leu 
335 

His Cys Asp Arg 
350 

Leu Phe Gin His 
365 

Lys Leu Lys Asp 

Val Asn Pro Pro 
400 

Phe Thr Leu Glu 
415 

Thr Pro Ala Ala 
430 

Arg Leu Glu Arg 
445 

Asp Lys Leu Phe 

Gin Ser Asp Lys 
480 

Tyr Pro Arg Ser 
495 
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Pro Asp Gly Tyr Leu Gin He Gly Ser Phe Tyr Lys Gly Val Ala Glu 

500 505 510 

Gly Glu Val Asp Pro Ala Phe Gly Pro Leu Glu Ala Leu Arg Leu Ser 

515 520 525 

He Gin Thr Asp Ser Pro Val Trp Val He Leu Ser Glu He Phe Leu 

530 535 540 

Lys Lys Ala Asp 
545 548 

SEQ ID NO: 3 8 

SEQUENCE LENGTH: 29 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS : single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION 

TTCTCGAGGA GATGAGGCTC CGCAATGGC 29 

SEQ ID NO: 3 9 

SEQUENCE LENGTH: 30 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION 

AATCTAGAAA TGTGGGCTTC AGGGCTGGC 3 0 
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SEQ ID NO: 40 

SEQUENCE LENGTH: 28 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS : single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION 

CCCTCGAG ATG GGG GTG CAC GAA TGT CC 28 
Met Gly Val His Glu Cys Pro 

SEQ ID NO: 41 

SEQUENCE LENGTH: 48 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS : single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

ANTISENSE: Yes 

SEQUENCE DESCRIPTION 

CTTTTTGCTT GTTAACTCCT TTAGTATTGG GGCGCCCAGG ACTGGGAG 

SEQ ID NO: 42 

SEQUENCE LENGTH: 48 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS : single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 
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SEQUENCE DESCRIPTION 

CTCCCAGTCC TGGGCGCCCC AATACTAAAG GAGTTAACAA GCAAAAAG 

SEQ ID NO: 43 

SEQUENCE LENGTH: 44 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY : 1 inear 

MOLECULE TYPE: synthetic DNA 

ANTISENSE: Yes 

SEQUENCE DESCRIPTION 

CTTCCTTCAT TTTGCAATAA ATGAGGGGCG CCCAGGACTG GGAG 44 

SEQ ID NO: 44 

SEQUENCE LENGTH: 43 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

ANTISENSE: Yes 

SEQUENCE DESCRIPTION 

ATTTAACTTC TCTCTTCACT GTAGGGGCGC CCAGGACTGG GAG 43 

SEQ ID NO: 45 
SEQUENCE LENGTH: 44 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
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TOPOLOGY : linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION 

CTCCCAGTCC TGGGCGCCCC TCATTTATTG CAAAATGAAG GAAG 

SEQ ID NO: 46 

SEQUENCE LENGTH: 43 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS : single 

TOPOLOGY : 1 inear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION 

CTCCCAGTCC TGGGCGCCCC TACAGTGAAG AGAGAAGTTA AAT 

SEQ ID NO: 4 7 

SEQUENCE LENGTH: 26 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS : single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

ANTISENSE: Yes 

SEQUENCE DESCRIPTION 

TTCTAGAA TCA CCC TTC CGC AAC ACC 26 
TRM Gly GIu Ala Val Gly 

SEQ ID NO: 48 
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SEQUENCE LENGTH: 28 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: synthetic DNA 
ANTISENSE: Yes 

SEQUENCE DESCRIPTION 

TTCTAGAA TCA AGG CAG AAC TTC CAC CG 28 
TRM Pro Leu Val Glu Val Thr 

SEQ ID NO: 49 

SEQUENCE LENGTH: 38 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS ; single 

TOPOLOGY : 1 inea r 

MOLECULE TYPE: synthetic DNA 

ANTISENSE: Yes 

SEQUENCE DESCRIPTION 

TTCTAGAA TCA TTT AAA TAG GAT GTA GTC TCC AGC TAC 
TRM Lys Phe Leu lie Tyr Asp Gly Ala Val 

SEQ ID NO: 50 

SEQUENCE LENGTH: 3 4 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 
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ANTISENSE: Yes 
SEQUENCE DESCRIPTION 

TTCTAGAA TCA TGG TTT CAT GTA ATC TTT ATC CG 
TRM Pro Lys Met Tyr Asp Lys Asp Thr 

SEQ ID NO: 51 

SEQUENCE LENGTH: 19 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS : single 

TOPOLOGY; linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION 
TTC ATG AGG CTC CGA AAT G 19 
Met Arg Leu Arg Asn Gly 

SEQ ID NO: 52 

SEQUENCE LENGTH: 29 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS : single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

ANTISENSE: Yes 

SEQUENCE DESCRIPTION 

CAACCT TCA CTT TGT GAC TTT TTT AAT AT 29 

TRM Asn Thr val Lys Lys lie His 
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SEQ ID NO: 53 
SEQUENCE LENGTH: 27 
5 SEQUENCE TYPE: nucleic acid 

STRANDEDNESS : single 
TOPOLOGY: linear 

10 

MOLECULE TYPE: synthetic DNA 



75 



20 



30 



35 



40 



SEQUENCE DESCRIPTION 

TC ATG ATA CTA AAG GAG TTA AC A AGC A 27 
Met He Leu Lys Glu Leu Thr Ser Lys 



SEQ ID NO: 54 
SEQUENCE LENGTH: 51 
25 SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 
TOPOLOGY: linear 
MOLECULE TYPE: synthetic DNA 
ANTISENSE: Yes 



45 



SEQUENCE DESCRIPTION 

CAAGCT TCA GTG GTG GTG GTG GTG GTG GTT TGT GAC TTT 
TRM His His His His His His Asn Thr Val Lys 

TTT AAT ATG GAT 51 
Lys He His He 



SEQ ID NO: 55 
SEQUENCE LENGTH: 18 
SEQUENCE TYPE: nucleic acid 



55 
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STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: synthetic DNA 
ANTISENSE: Yes 

SEQUENCE DESCRIPTION 
CAAGCT TCA GTG GTG GTG 18 
TRM His His His 



SEQ ID NO: 56 
20 SEQUENCE LENGTH: 3 2 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS : sing le 
25 TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 



30 



35 



AO 



50 



SEQUENCE DESCRIPTION 

TC ATG ATA TTA AAG GAG TTA ACA AGC AAA AAA 3 2 
Met lie Leu Lys Glu Leu Thr Ser Lys Lys 



SEQ ID NO: 57 
SEQUENCE LENGTH: 51 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
45 TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 
ANTISENSE: Yes 



SEQUENCE DESCRIPTION 



£5 
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CAAGCT TCA GTG GTG GTG GTG GTG GTG GTT GGT GGC TTT 
TRM His His His His His His Asn Thr Ala Lys 

TTT AAT ATG AAT 51 
Lys lie His He 

SEQ ID NO: 58 

SEQUENCE LENGTH: 49 

SEQUENCE TYPE: nucleic acid 

STRANDEDNES S : single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

ANTI SENSE : Yes 

SEQUENCE DESCRIPTION 

CAAGCT TCA GTG GTG GTG GTG GTG GTG TGG TTT CAT ATA 
TRM His His His His His His Pro Lys Met Tyr 

ATC TTT ATC C 49 
Asp Lys Asp 

SEQ ID NO: 59 

SEQUENCE LENGTH: 19 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS : single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION 
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TTC ATG AGG CTC CGC AAT G 19 
Met Arg Leu Arg Asn Gly 

SEQ ID NO: 60 

SEQUENCE LENGTH: 18 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS : single 

TOPOLOGY : 1 inear 

MOLECULE TYPE: synthetic DNA 

ANTISENSE: Yes 

.SEQUENCE DESCRIPTION 
CAGGTTGAGC TCCTTGGA 18 

SEQ ID NO: 61 

SEQUENCE LENGTH: 18 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION 
TCCAAGGAGC TCAACCTG 18 

SEQ ID NO: 62 
SEQUENCE LENGTH: 44 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 
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MOLECULE TYPE: synthetic DNA 
ANTISENSE: Yes 

SEQUENCE DESCRIPTION 

CAAGCT TCA GTG GTG GTG GTG GTG GTG GTC GGC CTT TTT CAG GA 44 
TRM His His His His His His Asp Als Lys Lys Leu Phe 

SEQ ID NO: 63 

SEQUENCE LENGTH: 23 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS : single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION 
CCC ATG GGC CGC CTA ACA GAG GA 23 
Met Gly Arg Leu Thr Glu Asp 



Claims 

1 . A pi — » 4 N-acetylglucosaminyltransferase having an activity to produce a saccharide having a partial structure 
represented by the formula below: 

GlcNAc/?l^ 4 , ^fMan 0 l-4GlcNAc- 

J^Man a 1 



using UDP-GlcNAc as a sugar donor and a saccharide having a partial structure represented by the formula below 
as a sugar receptor: 
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^jMan p l— 4GlcNAc- 

2 Man a 



w 2. The pi -* 4 N-acetylglucosaminyltransferase of claim 1 , wherein the saccharide as a sugar receptor is an oligosac- 
charide, polysaccharide, glycoconjugate (glycopeptide, glycoprotein, glycolipid or proteoglycan) or a derivative 
thereof. 

3. The pi -» 4 N-acetylglucosaminyltransferase of claim 1 , wherein the sugar receptor is a saccharide having the par- 
75 tial structure represented by the formula below: 

^Man a l 

«> ^ 3 Man p 1— 4GlcNAc—R3 

GlcNAc p 1 — 2Man a K 



25 



30 



wherein 

R1 
R2 
R3 
R4 



H-. Manal -> 6 or GlcNAcpl -> 6 
H-. Manal -* 3 or GlcNAcpl -> 2 
-OH or pi -» 4GlcNAc R4 
-OH, -H, -pyridylamine or -peptide chain. 



4. a p1 4 N-acetylglucosaminyltransferase comprising the amino acid sequences shown in SEQ ID NOS: 1-14 as 
partial sequences. 

5. a p1 -> 4 N-acetylglucosaminyltransferase consisting of the amino acid sequence shown in SEQ ID NO: 18 or the 
35 amino acid sequence shown in SEQ ID NO: 18 which has addition, deletion or substitution of one or more amino 

acid residues and yet which produces p1 -> 4 N-acetylglucosaminyltransferase activity. 

6. a pi — ► 4 N-acetylglucosaminyltransf erase consisting of a partial sequence of the amino acid sequence shown in 
SEQ ID NO: 18 including at least from position 93 to position 383 residues, or said partial sequence which has 

ao addition, deletion or substitution of one or more amino acid residues and yet which produces pi -> 4 N-acetylglu- 
cosaminyltransferase activity. 

7. A p1 — > 4 N-acetylglucosaminyltransferase consisting of the amino acid sequence shown in SEQ ID NO: 24 or the 
amino acid sequence shown in SEQ ID NO: 24 which has addition, deletion or substitution of one or more amino 

45 acid residues and yet which produces pi -> 4 N-acetylglucosaminyltransf erase activity. 

8. a p1 — > 4 N-acetylglucosaminyltransferase consisting of a partial sequence of the amino acid sequence shown in 
SEQ ID NO: 24 including at least from position 94 to position 383 residues, or said partial sequence which has 
addition, deletion or substitution of one or more amino acid residues and yet which produces pi -> 4 N-acetylglu- 

so cosaminyltransferase activity 

9. a pi — * 4 N-acetylglucosaminyltransferase consisting of the amino acid sequence shown in SEQ ID NO: 37 or the 
amino acid sequence shown in SEQ ID NO: 37 which has addition, deletion or substitution of one or more amino 
acid residues and yet which produces pi -> 4 N-acetylglucosaminyltransferase activity. 
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10. A p1 — » 4 N-acetylglucosaminyltransferase consisting of a partial sequence of the amino acid sequence shown in 
SEQ ID NO: 37 including at least from position 91 to position 390 residues, or said partial sequence which has 
addition, deletion or substitution of one or more amino acid residues and yet which produces p1 -> 4 N-acetylglu- 
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cosaminyltransferase activity. 

11. A pi -> 4 N-acetylglucosaminyltransferase gene coding for the pi -» 4 N-acetylglucosaminyltransferase of claim 5 
or 6. 

12. A pi — > 4 N-acetylglucosaminyltransferase gene coding for the pi 4 N-acetylglucosaminyltransferase of claim 7 
or 8. 

13. A p1 — > 4 N-acetylglucosaminyltransferase gene coding for the pi -> 4 N-acetylglucosaminyltransferase of claim 9 
or 10. 

14. A pi — ► 4 N-acetylglucosaminyltransferase gene consisting of the nucleotide sequence shown in SEQ ID NO: 17. 

15. A pi -> 4 N-acetylglucosaminyltransferase gene consisting of the nucleotide sequence shown in SEQ ID NO: 23. 

16. A pi 4 N-acetylglucosaminyltransferase gene consisting of the nucleotide sequence shown in SEQ ID NO: 36. 

17. A recombinant DNA obtained by inserting into a vector DNA the pi 4 N-acetylglucosaminy transferase gene of 
any one of claims 11 to 16. 

18. A chromosomal fragment comprising a part or all of the p1 -> 4 N-acetylglucosaminyltransf erase gene of any one 
of claims 11 to 16. 

1 9. A host cell carrying the recombinant DNA of claim 1 7. 

20. A host cell into which the chromosomal fragment of claim 18 is artificially introduced. 

21. A method for purifying the p1 4 N-acetylglucosaminyltransferase of any one of claims 1 to 3 from biological sam- 
ples. 

22. A method for producing a pi 4 N-acetylglucosaminyltransf erase comprising culturing the host cell of claim 19 in 
a medium and recovering the pi -> 4 N-acetylglucosaminyltransferase from the resultant culture. 

23. A method for producing a pi -> 4 N-acetylglucosaminyltransferase comprising culturing the host cell of claim 20 in 
a medium and recovering the pi -> 4 N-acetylglucosaminyltransferase from the resultant culture. 

24. A method for producing a pi -> 4 N-acetylglucosaminyltransferase comprising recovering the p1 -» 4 N-acetylglu- 
cosaminyltransf erase from the secreta, body fluid or homogenate originated from the host cell of claim 19. 

25. A method for producing a pi 4 N-acetylglucosaminyltransf erase comprising recovering the pi -> 4 N-acetylglu- 
cosaminyltransferase from the secreta, body fluid or homogenate originated from the host cell of claim 20. 

26. A saccharide of which the sugar chain structure is modified with the p1 — > 4 N-acetylglucosaminyltransferase of any 
one of claims 1 to 3. 

27. The saccharide of claim 26, which is an oligosaccharide, a glycopeptide, a glycoprotein or a derivative thereof. 

28. A method for modifying a branching structure of a sugar chain of a glycoprotein produced by a host cell, comprising 
introducing into the host cell the p1 -> 4 N-acetylglucosaminyltransferase gene of any one of claims 11 to 16. 

29. A glycoprotein in which a branching structure of a sugar chain is modified by the method of claim 28. 

30. The glycoprotein of claim 29, which is erythropoietin. 
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FIG.2 

Variations of Asn-Linked Sugar Chains 

Revised from Fig.1, Makoto Takeuchi, Wako 
Purechemical Newsletter 64,18-19,1996 
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FIG.5 

High Performance Liquid Chromatogram of 
GnT-IV Reaction Product 
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FIG.6 
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FIG.8 
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UDP-Hexanolamine Agarose Affinity Chromatography I 
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FIG. 10 
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FIG.11 
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FIG.12 
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Activity (nmol / h / ml) 
Native Gel Electrophoresis and Activity of purified GnT-IV 
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FIG.16 
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FIG. 19 




89 



EP 0 905 232 A1 



INTERNATIONAL SEARCH REPORT 



International application No. 

PCT/JP97/04546 



A CI-ASSIFICATION OF SUBJECT MATTER 

Int. CI 6 C12N9/10, C12N15/54, C12N5/10, C12N1/21, C12P21/02, C12P19/04 

According lo International Patent Classification (IPQ or to both national classification and IPC 



R FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbols) 

Int. CI* C12N9/10, C12N15/54, C12N5/10, C12N1/21, C12P21/02, C12P19/04 



Documentation searclied other than minimum documentation to the extent (hat such documents are included in ihc fields searched 



Electronic data hase consulted during the international search (name of data base and, where practicable, search terms used) 
CA (STN), REGISTRY ( STN ) , BIOSIS (DIALOG), WPI (DIALOG), 
PIR/SWISS-PROT/GENESEQ 



c. docum ents considered to be relevant 



Category* 



Citation of document, with indication, where appropriate, of the relevant passages 



Relevant lo claim No. 



PA 



X/A 



Glycocon jugate J . , Vol. 14, No. 6 ( 1997 ) MT Minowa 
et al. f "Cloning and expression of a newly purified 
N-acetyglucosaminyl transferase", p. 767 

Glycobiology , Vol . 1 , No. 4 ( 1991 ) M. Takeuchi et al- , 
"Structures and functional roles of the sugar chains 
of human erythropoietins", p. 337-34 6 



1-30 



26, 27, 29, 
30/1-25, 28 



] [ Further documents arc listed in the continuation of Box C. Sec patent family annex. 



• Special catcponts of cited documents: 

'A" documcni defining ihc general slate of the ait which is not 

considered to be of panicutar relevance 
"ET cjrlicr doc umcni hut published on 01 after the international filing date 
I document which may throw doubts on priority ciaim(R) or which is 
cited lo establish the publication date of another ci tat ion or other 
M>ceial reason las specified) 
O* document refemng to an oral disclosure, use. exhibition or othrr 
means 

"P" document published prior to the international filing dale but lalrr than 
ihc priority dale claimed 



T" later document published after the international filing date or priority 

date and not in conflict with the application but cited to understand 

the principle or theory underlying the invention 
"X" document of particular relevance: Ihe claimed invention cannot be 

considered novel or cannot be considered to involve bd inventive step 

when the document is laken alone 
"Y" document of particular relevance: the claimed invention cannot be 

considered to involve an inventive step when ihc document is 

combined with one or more other such docu menus, such combination 

being obvious to a person skilled in the art 
"A" document member of Ihe same patent family 



Date ot the actual completion of the international search 
March 10, 1998 (10. 03. 98) 



Date ot mailing of ihc internal ional search report 

March 24, 1998 (24. 03. 98) 



Nar.tc and mailing address of the ISA/ 

Japanese Patent Ot f ice 

Facsimile No. 



Authorized officer 



Telephone No. 



Form PCT/ISA/210 (second sbect) (July 1992) 



90 



